INEQUITIES IN THE SHANKS-RENYI PRIME NUMBER RACE: AN ASYMPTOTIC 

FORMULA FOR THE DENSITIES 
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Abstract. Chebyshev was the first to observe a bias in the distribution of primes in residue 
classes. The general phenomenon is that if a is a nonsquare (mod q) and 6 is a square (mod q), 
then there tend to be more primes congruent to a (mod q) than b (mod q) in initial intervals of the 
positive integers; more succinctly, there is a tendency for tt{x\ q, a) to exceed tt{x; q, b). Rubinstein 
and Sarnak defined 6{q; a, b) to be the logarithmic density of the set of positive real numbers x for 
which this inequality holds; intuitively, 5{q;a,b) is the "probability" that TT{x;q,a) > Tr{x:q,b) 
when x is "chosen randomly". In this paper, we establish an asymptotic series for S{q; a, b) that can 
be instantiated with an error term smaller than any negative power of q. This asymptotic formula is 
written in terms of a variance V{q; a, b) that is originally defined as an infinite sum over all nontriv- 
ial zeros of Dirichlet L-functions corresponding to characters (mod q); we show how V{q; a, b) can 
be evaluated exactly as a finite expression. In addition to providing the exact rate at which S{q; a, b) 
converges to ^ as g grows, these evaluations allow us to compare the various density values S{q; a, b) 
as a and b vary modulo q; by analyzing the resulting formulas, we can explain and predict which of 
these densities will be larger or smaller, based on arithmetic properties of the residue classes a and 
b (mod q). For example, we show that if a is a prime power and a' is not, then 5{q] a, 1) < S{q; a' , 1) 
for all but finitely many moduli q for which both a and a' are nonsquares. Finally, we establish rig- 
orous numerical bounds for these densities S{q; a, b) and report on extensive calculations of them, 
including for example the determination of all 117 density values that exceed 
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1. Introduction 



We have known for over a century now that the prime numbers are asymptotically evenly dis- 
tributed among the reduced residue classes modulo any fixed positive integer q. In other words, 
if 7r(x; g, a) denotes the number of primes not exceeding x that are congruent to a (mod q), then 
lim^_j.oo 7r(x; g, a)/Tx{x] q,b) = I for any integers a and b that are relatively prime to q. However, 
this information by itself is not enough to tell us about the distribution of values of the difference 
7r(x; q, a) — 7r(x; q, b), in particular whether this difference must necessarily take both positive and 
negative values. Several authors — notably Chebyshev in 1853 and Shanks [fT5l in 1959 — observed 
that 7r(x; 4, 3) has an extremely strong tendency to be greater than 7r(x; 4, 1), and similar biases 
exist for other moduli as well. The general phenomenon is that 7r(x; g, a) tends to exceed n(x;q, b) 
when a is a nonsquare modulo q and 6 is a square modulo q. 

In 1994, Rubinstein and Samak [|T4l developed a framework for studying these questions that 
has proven to be quite fruitful. Define 5(g; a, b) to be the logarithmic density of the set of real 
numbers x >\ satisfying 7r(x; g, a) > Tr{x; g, b). (Recall that the logarithmic density of a set S of 
positive real numbers is 



or equivalently the natural density of the set {logx: x G S}.) Rubinstein and Sarnak investigated 
these densities under the following two hypotheses: 

• The Generalized Riemann Hypothesis (GRH): all nontrivial zeros of Dirichlet L-functions 



• A linear independence hypothesis (LI): the nonnegative imaginary parts of these nontrivial 
zeros are linearly independent over the rationals 

Under these hypotheses, they proved that the limit defining 5{q; a, b) always exists and is strictly 
between and 1. Among other things, they also proved that 5(g; a, b) tends to | as g tends to 
infinity, uniformly for all pairs a, 6 of distinct reduced residues (mod g). 

In the present paper, we examine these densities (5(g; a, b) more closely. We are particularly 
interested in a quantitative statement of the rate at which S{q;a,b) approaches ^. In addition, 
computations show that for a fixed modulus g, the densities 5{q; a, b) vary as a and b range over 
nonsquares and squares modulo g, respectively. We are also interested in determining which pairs 
a, b (mod g) give rise to larger or smaller values of (5(g; a, b), and especially in giving criteria that 
depend as directly as possible on a and b rather than on analytic data such as the zeros of Dirichlet 
L-functions. 




l<x<X 



have real part equal to | 
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Our first theorem, which is proved in Section 



2.4 



exhibits an asymptotic series for S{q; a, b): 



Theorem 1.1. Assume GRH and LI. Let q be a positive integer, and let p{q) be the function defined 



in Definition L2 Let a and b be reduced residues (mod q) such that a is a nonsquare (mod q) and 



b is a square (mod q), and let V{q\ a, b) be the variance defined in Definition L3 Then for any 
nonnegative integer K, 

K , e 



6{q;a,b) 



+ 



^/27rV{q-a,b)j^^V{q;a,b) 



j=0 



2K+3 



V{q- a, by<+^/^ 



(1.1) 

where the real numbers Sq-a,b{^,j), which are bounded in absolute value by a function of i uni- 
formly in q, a, b, and j, are defined in Definition 2.23 In particular, Sq-a,b{0, 0) = 1, so that 

' P{<1) , r^f P{.<lf 



S{q;a,b) 



+ 



y/27rV{q;a,b) 



+ 



V{q;a,by/^ 



(1.2) 



We will see in Proposition 3.6 that V{q; a, b) ~ 20(g) logg, and so the error term in equa- 
tion ([TT]) is <A'£ 

The assumption that a is a nonsquare (mod q) and 6 is a square (mod q) is natural in this context, 
reflecting the bias observed by Chebyshev. Rubinstein and Samak showed (assuming GRH and 
LI) that S(q; b, a) + S{q; a,b) = 1; therefore if a is a square (mod q) and 6 is a nonsquare (mod q), 
the right-hand sides of the asymptotic formulas ( |1.1[ ) and ( |1.2[ ) become | — ■ ■ ■ instead of | + ■ ■ ■ . 
Rubinstein and Sarnak also showed that 6{q; b, a) = 6{q; a, 6) = | if a and b are both squares or 
both nonsquares (mod q). 

The definitions of p{q) and of V{q; a, b) are as follows: 

Definition 1.2. As usual, u{q) denotes the number of distinct prime factors of q. Define p{q) to be 
the number of real characters (mod q), or equivalently the index of the subgroup of squares in the 
full multiplicative group (mod q), or equivalently still the number of solutions of = 1 (mod q). 
An exercise in elementary number theory shows that 



p{(l) 



if2tg, 

if 2 I g but 4 f g, 
2'^(5), if4|gbut8tg, 
2^{<i)+i^ if8|g, 







which implies that p(g) g^ for every e > 0. 
Definition 1.3. For any Dirichlet character x (mod g), define 

L(l/2+i7,x)=0 

We adopt the convention throughout this paper that the zeros are listed with multiplicity in all such 
sums (though note that the hypothesis LI, when in force, implies that all such zeros are simple). 
For any reduced residues a and b (mod g), define 



V{q-a,b)= \x{h)-x{ci)\%{x)- 



X (mod q) 



We will see in Proposition 2.7 that V{q; a, h) is the variance of a particular distribution associated 
with the difference 7r(x; g, a) — it{x; q,b). <) 

As the asymptotic series in Theorem 1 1 . 1 1 depends crucially on the variance V{q; a, b), we next 
give a formula for it (established in Section |3j2]) that involves only a finite number of easily com- 
puted quantities: 

Theorem 1.4. Assume GRH. For any pair a, b of distinct reduced residues modulo q, 
V{q; a, b) = 20(g) {C{q) + K,{a~b) + i,{-ab-^) log 2) + 2M*(g; a, 6), 
where the functions C, Kg, and iq are defined in Definition 1.5 and the quantity M*{q;a,b) is 



defined in Definition 1.6 



The definitions of these three arithmetic functions and of the analytic quantity M* are as follows: 

Definition 1.5. As usual, 0(g) denotes Euler's totient function, and A(g) denotes the von Mangoldt 
function, which takes the value log p if g is a power of the prime p and otherwise. For any positive 
integer g, define 

£(4)=log«-E^ + ^-(7o + lo8-2-). 
^p-1 0(g) 

p\g 

where 70 = \im^_,^oo {J2n<x n ~ ^ogx) is Euler's constant; it can be easily shown that C{q) 
is positive when g > 43. Note that £(g) = log(g/27re'''o) when g is prime and that C(q) = 
log g + 0(log log g) for any integer g > 3. Also let 



1, if n = 1 (mod g), 
0, if n ^ 1 (mod g) 

denote the characteristic function of the integers that are congruent to 1 (mod g). Finally, define 

^ _ A(g/(g,n)) _ A(g) 
^ 0(g/(g,n)) 0(g)' 

Note that these last two functions depend only on the residue class of n modulo g. For this reason, 
in expressions such as L.q(n^^) or Kq{n^^), the argument n^^ is to be interpreted as an integer that 
is the multiplicative inverse of n (mod g). In addition, note that Kq{n) > 0, since the only way 
that the second term can contribute is if g is a prime power, in which case the first term contributes 
at least as much. On the other hand Kg is bounded above, since if g/(g, n) is a power of the prime 
p then Kq{n) < (logp) /{p — 1) < log 2. Note also that Kg{n) = when (n, g) = 1. <) 

Definition 1 .6. As usual, L{s,x) = Yl'^=i xii^)i^ '^ denotes the L-function associated to the Dirich- 
let character x- Given such a character x (mod g), let g* denote its conductor (that is, the smallest 
integer d such that x is induced by a character modulo d), and let x* be the unique character 
modulo g* that induces x- Now define 

M*{q;a,b)= - xWP^}^ 

X (mod q) I ' X / 

M{q;a,b)= ^1 " ^(^) ''wfr ' 

X(modi3) ^ ' '^'^ 

Xt^XO 
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and 





The formula for V{q;a,b) in Theorem 1.4 is exact and hence well suited for computations. 
For theoretical purposes, however, we need a better understanding of M*(q; a, b), which our next 
theorem (proved in Section [33] ) provides: 

Theorem 1.7. Assume GRH. For any pair a,b of distinct reduced residues modulo q, let ri and 
r2 denote the least positive residues of ab~^ and ba^^ (mod q), and let the quantity H{q; a, b) be 
defined in Definition 1.8 Then 

M- a» = m + ^ + (,; a. ft) + O (!^ 

\ n r2 \ q 

where the implied constant is absolute. 

(The unexpected appearance of the specific integers ri and r2, in a formula for a quantity de- 
pending upon entire residue classes (mod g), is due to the approximation of infinite series by their 
first terms — see Proposition 3.12 ) The quantity H{q] a, b) is usually quite small, unless there is an 
extreme coincidence in the locations of a and b relative to the prime divisors of q, which would be 
reflected in a small value of the quantity e(g; p, r) defined as follows: 

Definition 1.8. Given an integer q and a prime p, let z/ > be the integer such that p^ \\ q (that 
is, p" I q but p^'^^ \ q). For any reduced residue r (mod g), define e(g; p, r) = min{e > 1: p^ = 
(mod q/p")}, and define 

h( \ - ^ ^"^^ 

When r is not in the multiplicative subgroup generated by p (mod q/p"), we make the convention 
that e(g; p, r) = oo and h{q;p, r) = 0. Finally, for any integers a and b, define 

H{q; a,b) = Y^ {h{q; p, ab'^) + h{q; p, ba'^)) . 
p\g 

Note that if g = p'' is a prime power, then h{q; p, r) = (logp) /p'^ip — 1) is independent of r, which 
implies that H{q; a, b) <^ (log q)/q when g is a prime power. <) 

The extremely small relative error in Theorem ] 1.1 [ implies that the formula given therein is useful 
even for moderate values of g. The following corollary of the above theorems, the proof of which 
is given in Section |4.1[ is useful only for large g due to a worse error term. It has the advantage, 
however, of isolating the fine-scale dependence of 6{q; a, b) on the residue classes a and b from its 
primary dependence on the modulus g: 

Corollary 1.9. Assume GRH and LI. Let q > A3 be an integer. Let a and b be reduced residues 
(mod g) such that a is a nonsquare (mod g) and b is a square (mod g), and let ri and r2 denote the 
least positive residues ofab^^ and ba~^ (mod g). Then 

SW,a,b)='- + —J^^(l-^4p^ + 0(^]), (1.3) 
' ^ 2 2v/7r0(g)£(g) V 2£(g) Vlog^g;; 

where 

A(g; a, b) = K,{a - b) + L,{-ab-^) log 2 + + + H{q; a, b) (1.4) 

ri r2 

(here, the functions C, Kg, and tq are defined in Definition L5 and H is defined in Definition L8). 
Moreover, A(g; a, b) is nonnegative and bounded above by an absolute constant. 
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Armed with this knowledge of the delicate dependence of S{q; a, h) on the residue classes a and 
h, we are actually able to "race races", that is, investigate inequalities between various values of 
5{q] a, b) as q increases. We remark that Feuerverger and Martin [5, Theorem 2(b)] showed that 
5{q; a, b) = 5{q; ab~^, 1) for any square b (mod q), and so it often suffices to consider only the 
densities 5{q; a, 1). Some surprising inequalities come to light when we fix the residue class a and 
allow the modulus q to vary (among moduli relatively prime to a for which a is a nonsquare). Our 



next theorem, which is a special case of Corollary 4.3 derived in Section 4.2 demonstrates some 
of these inequalities: 

Theorem 1.10. Assume GRH and LI. 

• For any integer a ^ —1, we have 6{q; —1, 1) < 6{q; a, 1) for all but finitely many integers q 
with {q, a) = 1 such that both —1 and a are nonsquares (mod q). 

• If a is a prime power and a' ^ —1 is an integer that is not a prime power, then 6{q; a, 1) < 
6{q; a', 1) for all but finitely many integers q with {q, aa') = 1 such that both a and a' are 
nonsquares (mod q). 

• If a and a' are prime powers with A(a)/a > A{a')/a', then 6{q; a, 1) < 6{q] a', 1) for all but 
finitely many integers q with {q, aa') = 1 such that both a and a' are nonsquares (mod q). 

Finally, these results have computational utility as well. A formula [|5] equation (2-57)] for 
calculating the value of a, b) is known. However, this formula requires knowledge of a large 
number of zeros of all Dirichlet L-functions associated to characters (mod q) even to estimate via 
numerical integration; therefore it becomes unwieldy to use the formula when q becomes large. 

can be made completely effective, and the 



On the other hand, the asymptotic series in Theorem 1 . 1 



1.4 Therefore the densities S{q; a,b) can 



calculation of V{q; a, b) is painless thanks to Theorem 
be individually calculated, and collectively bounded, for large q. 

For example, the values of 5{q] a, b) for all moduli up to 1000 are plotted in Figure [T] The 
modulus q is given on the horizontal axis; the vertical line segment plotted for each q extends 
between the maximal and minimal values of a, 6), as a runs over all nonsquares (mod q) and 
b runs over all squares (mod q). (Of course both a and b should be relatively prime to q. We 
also omit moduli of the form q = 2 (mod 4), since the distribution of primes into residue classes 
modulo such q is the same as their distribution into residue classes modulo q/2.) 

The values shown in Figure [T] organize themselves into several bands; each band corresponds 
to a constant value of p{q), the effect of which on the density S{q; a, b) can be clearly seen in 
the second term on the right-hand side of equation \13) . For example, the lowest (and darkest) 
band corresponds to moduli q for which p{q) = 2, meaning odd primes and their powers (as well 



I I 



Figure 1 . All densities S{q; a, b) with q < 1000 



1 'I,' .'.hi 



' I > ' > I , I 
' i' li ' ^1 



y^'^^^^v >wij..-..^..i^, ;." ■■ ■•>■■■■. i. . . 



6 



as g = 4); the second-lowest band corresponds to those moduli for which p(g) = 4, consisting 
essentially of numbers with two distinct prime factors; and so on, with the first modulus q = 840 
for which p{q) = 32 (the segment closest to the upper right-hand corner of the graph) hinting at 
the beginning of a fifth such band. Each band decays roughly at a rate of l/^/q\ogq, as is also 



we are 



evident from the aforementioned term of equation ( |1.3[ ). 

To give one further example of these computations, which we describe in Section 5.4 
able to find the largest values of 5{q; a, b) that ever occur. (All decimals listed in this paper are 
rounded off in the last decimal place.) 

Theorem 1.11. Assume GRH and LI. The ten largest values of6{q; a, b) are given in TableQ 
Table 1 . The top 10 most unfair prime number races 



Q 


a 


b 


6{q;a,b) 


24 


5 


1 


0.999988 


24 


11 


1 


0.999983 


12 


11 


1 


0.999977 


24 


23 


1 


0.999889 


24 


7 


1 


0.999834 


24 


19 


1 


0.999719 


8 


3 


1 


0.999569 


12 


5 


1 


0.999206 


24 


17 


1 


0.999125 


3 


2 


1 


0.999063 



Our approach expands upon the seminal work of Rubinstein and Sarnak [fT4ll . who introduced a 
random variable whose distribution encapsulates the information needed to understand n(x;q,a) — 
7r(x; q, b). We discuss these random variables, formulas and estimates for their characteristic func- 
tions (that is, Fourier transforms), and the subsequent derivation of the asymptotic series from 
Theorem 1 1.1 1 in Section [2| In Section [3] we demonstrate how to transform the variance V{q] a, b) 
from an infinite sum into a finite expression; we can even calculate it extremely precisely using 
only arithmetic (rather than analytic) information. We also show how the same techniques can 
be used to establish a central limit theorem for the aforementioned distributions, and we outline 
how modifications of our arguments can address the two-way race between all nonresidues and 
all residues (mod q). We investigate the fine-scale effect of the particular residue classes a and b 
upon the density 5(g; a, b) in Section|4| we also show how a similar analysis can explain a "mirror 
image" phenomenon noticed by Bays and Hudson [2J. Finally, Section [5] is devoted to explicit 
estimates and a description of our computations of the densities and the resulting conclusions. 



including Theorem 1.11 
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2. The asymptotic series for the density a, b) 

The ultimate goal of this section is to prove Theorem ] 1.1[ We begin in Section [27T] by describing a 
random variable whose distribution is the same as the limiting logarithmic distribution of a suitably 
normalized version of 7r(a;; g, a) — 7r(x; q, b), as well as calculating its variance. This approach is 
the direct descendant of that of Rubinstein and Sarnak [|f4|: one of our main innovations is the 
exact evaluation of the variance V{q; a, b) in a form that does not involve the zeros of Dirichlet L- 



functions. In Section 2.2 we derive the formula for the characteristic function (Fourier transform) 



of that random variable; this formula is already known, but our derivation is slightly different and 



allows us to write the characteristic function in a convenient form (see Proposition 2.12). We then 
use our knowledge of the characteristic function to write the density 5{q; a, b) as the truncation of 
an infinite integral in Section [23| where the error terms are explicitly bounded using knowledge of 
the counting function N{T, x) of zeros of Dirichlet L-functions. Finally, we derive the asymptotic 
series from Theorem 1 1 . 1 1 from this truncated integral formula in Section 2.4 



2.1. Distributions and random variables. We begin by describing random variables related to 
the counting functions of primes in arithmetic progressions. As is typical when considering primes 
in arithmetic progressions, we first consider expressions built out of Dirichlet characters. 

Definition 2.1. For any Dirichlet character x such that GRH holds for x), define 



E{x,x)= I 



X 



17 



2+^7 



7eM 

L{l/2+i7,x)=0 

This sum does not converge absolutely, but (thanks to GRH and the functional equation for Dirich- 
let L-functions) it does converge conditionally when interpreted as the limit of as T tends to 
infinity. All untruncated sums over zeros of Dirichlet L-functions in this paper should be similarly 
interpreted. <0> 

Definition 2.2. For any real number 7, let denote a random variable that is uniformly distributed 
on the unit circle, and let denote the random variable that is the real part of Z^. We stipulate 
that the collection {Z^}^>o is independent and that = Z^; this implies that the collection 
{X^}^>o is also independent and that = X^. <) 

By the limiting logarithmic distribution of a real- valued function /(t), we mean the measure dv 
having the property that the limiting logarithmic density of the set of positive real numbers such 
that f{t) lies between a and (3 is du for any interval {a, (3). 

Proposition 2.3. Assume LI. Let {cy^ : x (mod q)} be a collection of complex numbers, indexed by 
the Dirichlet characters (mod q), satisfying = c^. The limiting logarithmic distribution of the 
function 

X (mod q) 



is the same as the distribution of the random variable 

2 E M E 



X(modg) 7>0 \/ 1 + 7^ 

L(l/2+i7,x)=0 V 



Proof. We have 



^ -|- 

X(modg) x(modg) |7|<T 2 



L(l/2+i7,x)=0 



X (mod q) ^ 0<7<T 2 ' -T<7<0 2 ' ' - 

L(l/2+i7,x)=0 L(l/2+j7,x)=0 



iv(.i/z-M7,x;=u 1.1^1/ z-i-?7,x;=u 

(The assumption of LI precludes the possibiUty that 7 = 0.) By the functional equation, the zeros 
of L(s, x) below the real axis correspond to those of L(s, x) above the real axis. Therefore 

E c,^(-,x)=^ii^m^ j: cj y: t— + E r— ) (2.1) 

X(modg) X (mod g) ^ 0<7<T 2 ' 0<7<T 2 '/ 

L(l/2+i7,x)=0 L(l/2+i7,x)=0 

= E E T— + E E I 



X(modg) 0<7<T 2 ' x(modq) 0<7<T 2 ' 

L(l/2+i7,x)=0 L(l/2+i7,x)=0 



Reindexing this last sum by replacing x by x> we obtain 

E c,E{x,x) = y^^i^ E ^x E E ^x E i^) 

X(modg) ^ X (mod g) 0<7<T 2 ' x(modq) 0<7<T 2 '/ 

L(l/2+i7,x)=0 L(l/2+i7,x)=0 



' X (mod g) 0<7<T 

L(l/2+i7,x)=0 



X,7 



2 hm > Re > — , 

X(modq) ^ 0<7<T \/ I + 7 

L(l/2+i7,x)=0 V 



where 9^^^ = C;^| | + ?7|/|c;^| + 27) is a complex number of modulus 1. The quantity e*'>'^°^^^^,^ 
is uniformly distributed (as a function of log x) on the unit circle as x tends to infinity, and hence 
its limiting logarithmic distribution is the same as the distribution of Z^. Since the various 7 in 
each inner sum are linearly independent over the rationals by LI, the tuple (e*''^'°s^6'^ ,y)o<7<r is 
uniformly distributed in the A'^(T', x) -dimensional torus by Kronecker's theorem. Therefore the 
limitinp- Inp-arithmir, Histrihiitinn nf the, sum 



limiting logarithmic distribution of the sum 

E 

3<7<- 
L(l/2+i7,x) 



0<7<T V i + 7^ 
■ ■ =0 » 
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is the same as the distribution of the random variable 

0<1<T \/ i + 7^ 
L(l/2+n,x)=0 V 

Finally, the work of Rubinstein and Sarnak [T4', Section 3.1] shows that the limiting logarithmic 
distribution of 

X V"— H> X V-"" 0<7<T \ J + T 

L(l/2+i7,x)=0 V 

is the same as the distribution of the random variable 



X (mod q) X (mod g) 0<7<T \/ 7 + 7 

L(l/2+i7,x)=0 V 



2 E E ^'^ 



X (mod q) 7>0 . ; _ -i- o/2 

L(l/2+i7,x)=0 



the convergence of this last limit being ensured by the fact that the are bounded and that each 
of the sums 



7>0 
L(l/2+i7,x)=0 

is finite. This establishes the lemma. □ 

We shall have further occasion to change the indexing of sums, between over all 7 and over only 
positive 7, in the same manner as in equations ( |2.1[ ) and ( |2.2[ ); henceforth we shall justify such 
changes "by the functional equation for Dirichlet L-functions" and omit the intermediate steps. 

Definition 2.4. For any relative prime integers q and a, define 

c(g; a) = — 1 + (mod q): = a (mod q)}. 



Note that c(g; a) takes only the values —1 and p(g) — 1. Now, with X^ as defined in Definition 2.2 
define the random variable 

Xg.a,b = ciq,b)-ciq,a) + 2 ^ |x(&)-x(a)| ^ ^ 



X (mod q) 7>0 A/I + 7^ 

L(l/2+i7,x)=0 V * 

Note that the expectation of the random variable Xg-a^ is either ±p{q) or 0, depending on the 
values of c(g, a) and c(g, 6). <) 

Definition 2.5. With n{x;q,a) = #{p < x: p prime, p = a(modg)} denoting the counting 
function of primes in the arithmetic progression a (mod q), define the normalized error term 

E{x; g, a) = {(j){q)n{x; g, a) - n{x)) . 

\JX 
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The next proposition characterizes the limiting logarithmic distribution of the difference of two 
of these normalized counting functions. 

Proposition 2.6. Assume GRH and LI. Let a and h be reduced residues modulo q. The limiting 
logarithmic distribution of E{x; q,a) — E{x;q,b) is the same as the distribution of the random 
variable Xg.a,b defined in Definition \2.4\ 



Remark. Since S{q; a, b) is defined to be the logarithmic density of those real numbers x for which 
n{x; q, a) > 7r(x; g, b), or equivalently for which E{x; q, a) > E{x; q, b), we see that 6{q; a, b) 
equals the probability that Xg.a,b is greater than 0. However, we never use this fact directly in the 
present paper, instead quoting from [|5l a consequence of that fact in equation ( |2.10[ ) below. 

Proof. As is customary, define 

n<x 

A consequence of the explicit formula for ip{x, x) that arises from the analytic proof of the prime 
number theorem for arithmetic progressions ( [[TT[ Corollary 12.11] combined with [illi (12.12)]) 
is that for x Xo, 

E ^1/2+^7 
1 , . + 0{\ogq ■ logx) 

L(l/2+i7,x)=0 

under the assumption of GRH. We also know [[T4l Lemma 2.1] that 

Eix-q,a) = -c{q,a)+ V x{a)^!^^ + oJ . (2.3) 

XT^XO 



Combining these last two equations with Definition |2.1| for E{x, x), we obtain 

E{x]q,a) = -c{q,a) - ^ x{a)E{x,x) + Oq(:^^j . 



X (mod g) 
XT^XO 

We therefore see that 

E{x;q,a) - E{x;q,b) = c{q,b) -c{q,a) + {x{b) - x{a))E{x,x) + Og(:^^j 

X(modg) \ Og^;/ 

(where we have added in the x = Xo term for convenience). The error term tends to zero as x grows 
and thus doesn't affect the limiting distribution, and the constant c(g, b) — c{q, a) is independent 
of X. Therefore, by Proposition 2.3 the limiting logarithmic distribution of E{x; q, a) — E(x; q, b) 
is the same as the distribution of the random variable 

c(g,6)-c(g,a) + 2 \m - x{a)\ E ^^=- 

X (mod q) 7>0 \/ i + 7^ 

L(l/2+i7,x)=0 V ^ 

Since |x(6) — = — this last expression is exactly the random variable Xq.a,b as 

claimed. □ 

To conclude this section, we calculate the variance of the random variable Xq-a,b- 

11 



Proposition 2.7. Assume LI. Let {cy^: x (mod g)} be a collection of complex numbers satisfying 
~ constant fi, the variance of the random variable 

/^ + 2 E E -]== (2.4) 

X (mod q) 7>0 \/ I + 7^ 

L(l/2+n,x)=0 V ^ 

equals X^xCmodg) kxl^^(^)' ^here b{x) defined in Definition L3 In particular, the variance of 
the random variable Xq-a,b defined in Definition 2.4 is equal to the quantity V{q; a, h) defined in 



Definition 1.3. 



Proof. The random variables {X^ : 7 > 0} form an independent collection by definition; it is 
important to note that no single variable can correspond to multiple characters due to the 
assumption of LI. The variance of the sum ( |2.4| ) is therefore simply the sum of the individual 
variances, that is, 

E 1^.1 E 7=)=^ E Ki' E P^- 

^ X (mod q) 7>0 \/ 1 + 7 X (mod q) 7>0 4 ^ ' 

L(l/2+i7,x)=0 V L(l/2+i7,x)=0 

The variance of any is \, and so this last expression equals 

2 E E 47 

X(modi}) 7>0 4 ' 

L(l/2+i7,x)=0 

= E l^xl^ E 1 I ^2 + E l^xl^ E 1 I ^2 

X(mod(j) 7>0 4 ^ ' x (mod g) 7<0 4 ^ ' 

L(l/2+j7,x)=0 L(l/2+j7,x)=0 

= E i-.r^ E E KP'(^) 

X (mod q) 7£R 4 i^ (mod q) 

L(l/2+j7,x)=0 

by the functional equation for Dirichlet L-functions. The fact that V{q] a, h) is the variance of 
Xq-^afi now follows dircctly from their definitions. □ 

2.2. Calculating the characteristic function. The characteristic function Xq.^a,b{z) of the random 
variable Xq-^a,b will be extremely important to our analysis of the density 5{q] a, h). To derive the 
formula for this characteristic function, we begin by setting down some relevant facts about the 
standard Bessel function Jq of order zero. Specifically, we collect in the following lemma some 
useful information about the power series coefficients A„ for 

oo 

logJo(z) = E^n^"' (2-5) 

n=0 

which is valid for |2;| < ^ since Jq has no zeros in a disk of radius slightly larger than y centered 
at the origin. 



Lemma 2.8. Let the coefficients A„ be defined in equation ( |2.5| ). Then: 

(a) A„ ^ (]^)" uniformly for n > 0; 

(b) Ao = and X2m-i = Ofor every m > 1; 

(c) A2m < 0/or every m > 1; 
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(d) A„ is a rational number for every n > 0. 

Proof. The fact that log Jo is analytic in a disk of radius slightly larger than y centered at the 
origin immediately implies part (a). Part (b) follows from the fact that Jq is an even function with 
Jo(0) = 1. Next, Jo has the product expansion yjl Section 15.41, equation (3)] 



k=l 

where the Zk are the positive zeros of Jq. Taking logarithms of both sides and expanding each 

12 
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summand in a power series (valid for \z\ < ^ as before) gives 



k=l ^ „=i ^ k=l 

which shows that A2n = —n^^ JZ'kLi -^/T^" negative, establishing part (c). Finally, the Bessel 
function Jo{z) = J2m=o (— |)'"2;^™/(m!)^ itself has a power series with rational coefficients, as 
does log(l + z); therefore the composition log(l + (Jo(-2) — 1)) also has rational coefficients, 
establishing part (d). □ 



Definition 2.9. Let A„ be defined in equation ( |2.5| ). For any distinct reduced residues a and 
b (mod q), define 

o2n|\ I 1 

' ^ X(modq) 7>0 ^ ' ' ' 

L(l/2+i7,x)=0 



where V{q] a, h) was defined in Definition 1.3 so that Wi{q] a,h) = \ for example. <C> 

In fact, Wn{q] a, h)V{q] a, h) is (up to a constant factor depending on n) the 2nth cumulant of 
a, 6), which explains why it will appear in the lower terms of the asymptotic formula. We 
have normalized by V{q] a, b) so that the Wn{q; a, b) depend upon q, a, and 6 in a bounded way: 

Proposition 2.10. We have Wn{q\a,b) ^ (y)^" uniformly for all integers q and all reduced 
residues a and b (mod q). 

Proof. From Definition |Z9 and Lemma [2^ a), we see that 

02n 1 



L(l/2+i7,x)=0 

' ^ X(modg) 7>0 ' ' 

L(l/2+i7,x)=0 

= (|)2V"-24"-l « (f 

as claimed. □ 
The following functions are necessary to write down the formula for the characteristic function 
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Definition 2.11. For any Dirichlet character define 

Fiz,x)= n ^0 

7>0 

La+i-y,x)=o 



2z 



Then define 

%;a,b{^)= n F{\x{a)-X{b)\z,x) 

X (mod q) 

for any reduced residues a and 6 (mod q). Note that x) | < 1 for all real numbers x, since the 
same is true of Jo . <) 

The quantity Wn{q; a, b) owes its existence to the following convenient expansion: 

Proposition 2.12. For any reduced residue classes a and b (mod q), 

^q;a,bi^) = exp ( -V {q; a, b) ^ W^{q] a, b)z^'^ \ 

^ m=l ^ 



for \z\ < In particular, 



^q;aA^) = c-''^'^-'^''^' (l + 0(l-(g; a, b)z')) 



fijr \z\ < mm{V{q;a,b) 



Proof. Taking logarithms of both sides of the definition of ^g-a,b{z) in Definition 2.1 1 yields 

'2\xia)-xib)\z' 



X (mod g) 7>0 

L(l/2+i7,x)=0 



Since |z| < ^, the argument of the logarithm of Jq is at most 2 ■ 2 ■ = ^, and so the power 
series expansion (|2.5|) converges absolutely, giving 



OO / 



X (mod q) 7>0 r),=0 

L(l/2+i7,x)=0 



2|x(a )-xWk A" 



By Lemma [2^ b) only the terms n = 2m with m > 1 survive, and by Lemma 2.8 'c) we may 
replace A2,n by — |A2m|- We thus obtain 



log$,;,,,(z) = -5^^2'"-|A2™|22- J] Ix(a)-X 

X (mod g) 



2m 



E 



1 



m=0 



7>o + 

L(l/2+i7,x)=0 



^V^(g;a,6)Wm(g;a,6); 



,2m 



m=l 



for |z| < ^, by Definition 2.9 for Wml?; a, This establishes the first assertion of the proposition. 
By Proposition [OO} we also have 

OO o 



10^2"^^2m ^ (10/3) ^ ^ ^4 



m=2 



m=2 



1 - 100^79 



(2.7) 
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uniformly for \z\ < |, say. Therefore by the first assertion of the proposition, 

^q;aAz) = exp {-V{q; a, b)Wi{q; a, b)z'^) exp ( -V{q; a, b) ^ W„r{q; a, b)z^"' j 



m=2 

,2 , 



as long as V{q] a, b)z^ < 1. This establishes the second assertion of the proposition. □ 

All the tools are now in place to calculate the characteristic function Xq.a,b{z) = E(^e^^^i'°--''y 
Proposition 2.13. For any reduced residue classes a and b (mod q), 

In particular, 

log Xq.a,b{z) = i (c(g, a) - c(g, b))z - ^V{q]a,b)z^ + O {V{q] a, b)z'^) 

for \z\ < I. 

Remark. The first assertion of the proposition was shown by Feuerverger and Martin [3] by a 
slightly different method. Unfortunately an i in the exponential factor of [5. equation (2-21)] is 
missing, an omission that is repeated in the statement of [5, Theorem 4]. 

Proof. For a random variable X, define the cumulant-generating function 

^x(t)=logX(t)=logE(e**^) 

to be the logarithm of the characteristic function of X. It is easy to see that gaxif) = gx{oLt) 
for any constant a. Moreover, if X and Y are independent random variables, then E(e**^e**^) = 
E(e**"^)E(e**^) and so gx+Y{t) = gx{t) + grit)- Note that if the random variable C is constant 
with value c, then gcit) = itc. 



We can also calculate gx (t) where X^ was defined in Definition 2.2 Indeed, if is a random 
variable uniformly distributed on the interval [— tt, tt], then Zy = e*® and thus X^ = cos 6, whence 

where Jq is the Bessel function of order zero [|T1 9.1.21]. 



From Definition 2.4 the above observations yield 



X (mod g) 7>0 

L(l/2+j7,x)=0 



in other words. 



logX,,,,,it)=tt{c{q,b)-c{q,a))+ E log Jo ( ^'^^^^-^ t] 

X(modg) 7>0 ^ + 1 / 

L(l/2+i7,x)=0 

= it{c{q, b) - c{q, a)) + log <l>q;a,b{x) (2.8) 
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according to Definition 2.1 1 Exponentiating both sides establishes the first assertion of the propo- 
sition. To establish the second assertion, we combine equation ( |2.8[ ) with Proposition 2.12 to see 
that for l-z] < |, 



logXq.a,b{t) = it{c{q, h) - c(g, a)) - V{q\ a, h) ^ Wm{q] a, h)'. 



,2m 



m=l 



it{c{q, b) - c{q, a)) - Wiq; a, b)z^ + 0{V{q; a, b)z^ 



by the estimate ( |2.7[ ) and the fact that Wi {q; a, b) 



1 

2' 



□ 



2.3. Bounds for the characteristic function. A formula (namely equation ( |2.10[ ) below) is known 
that relates S{q; a, b) to an integral involving Using this formula to obtain explicit estimates 

for 5{q] a, b) requires explicit estimates upon ^q;a,b\ our first estimate shows that this function takes 
its largest values near 0. 

Proposition 2.14. Let < k < ^. For any reduced residue classes a and b (mod q), we have 

\^q,a,b{t)\ < \'^q-aM\forallt> K. 

Proof. From Definition |2?TT it suffices to show that for any real number 7 > 0, 



Jo 



2\x{a)-xmt 
a/1/4 + 72 



< 



Jo 



2\x{a) - x{b)\^ 



v/l/4 + 7' 

for all t > K. We use the facts that Jo is a positive, decreasing function on the interval [0 
that Jo(i) > |-^o(a;)| for all x > |. Since 

2\x{a ) - x{h)\K 2 ■ 2 ■ 5/24 ^ 5 

3' 



(2.9) 



, |] and 



< 



v/i/4 + 7' 

we see that Jq is positive and decreasing on the interval 

''2\x{a)~x{h)\K 5" 



^174 + 



Together with Jo(|) > l-^o(3;)| for all x this establishes equation ( |2.9| ) and hence the lemma. 

□ 

Let A^(T, x) denote, as usual, the number of nontrivial zeros of L(s, x) having imaginary part at 
most T in absolute value. Since the function ^q-a,h is a product indexed by these nontrivial zeros, 
we need to establish the following explicit estimates for N(T, x)- Although exact values for the 
constants in the results of this section are not needed for proving Theorem |1.1[ they will become 
necessary in Section B] when we explicitly calculate values and bounds for 5{q; a, b). 



Proposition 2.15. Let the nonprincipal character x (mod q) be induced by x* (mod q*). For any 
real number T > 1, 



ForT > 100, 



T a*T Q*T 
N(T, x) < - log — + 0.68884 log - — + 10.6035. 
71 zne zTce 



44T Q*T 
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Proof. We cite the following result of McCurley [fTOl Theorem 2.1]: for T > 1 and r] E (0, 0.5], 

<Ci logq*T + C2, 



T Q*T 
iV(T, x)-- log ^ 



with Ci 



l+2r) 



iogC{i±5) _ 2 iogC(2+2,) ^ g iogC(j+2r,) _ (McCuriey 



, „ and C2 = .3058 - .268r7 + 4 , „ - , o . ,0 

TT log 2 ^ ' log 2 log 2 TT log 2 

states his result for primitive nonprincipal characters, but since L{s, x) and L(s, x*) have the same 
zeros inside the critical strip, the result holds for any nonprincipal character.) Taking 77 = 0.25, we 
obtain 



iV(T, x)-- log ^ 
TT 27re 



< 0.68884 log q*T + 8.64865 < 0.68884 log — + 10.6035. 

27re 



This inequality establishes the first assertion of the proposition. The inequality also implies that 



44T Q*T 
NiT, x) > — log ^ + 



457r 



27re 



T \. q*T 

- .68884 log 10.6035 



457r 



27re 



the second assertion of the proposition follows upon calculating that the expression in parentheses 
is at least —10.551 when T > 100 (we know that g* > 3 as there are no nonprincipal primitive 
characters modulo 1 or 2). □ 

The next two results establish an exponentially decreasing upper bound for ^q-a,bit) when t is 
large. 

Lemma 2.16. For any nonprincipal character x (mod q), we have \F{x, x)F{x, x)\ ^ e~°'^^^^^ 
for X > 200. 



Proof. First note that 

io Vv/l/4 + 7V 



n 



7>0 
L(l/2+n,x)=0 



7<0 
L(l/2+i7,x)=0 



2x 



Vl/4+(-7)^ 



by the identity L{s,x) = L{s,x), and therefore 

F{x,x)Fix,x)= n 



2x 



7eiR 

L(l/2+i7,x)=0 



+ T 



Using the bound [fT4l equation (4.5)] 



I Jo(2;)| < min <^ 1, 



we see that for x > 1, 

\F{x,x)Fix,x)\< n 

-a;/2<7<x/2 
L(l/2+i7,x)=0 



Jo 



2x 



vyT74 



+ r 



< 



n 



(1/4+72)1/4 



h\<x/2 
L{l/2+j7,x)=0 



TCX 



When X > I and I7I < x/2, the factor (1/4 + 7 



2N1/4, 



TTX] 



-1/2 



never exceeds 1/2. Therefore 



\Fix,x)Fix,x)\ < 2-^(^-/2'X) = exp (-(log 2)iV(x/2, x)) • 
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By Proposition |2.15[ we thus have for x > 200 



\ 457r Ane 
< exp 1^-0. 107866a; log ^ + 7.3183^ < e^^'^^^^^, 

as claimed. □ 

Proposition 2.17. For any distinct reduced residue classes a and b (mod q) such that {ab, q) = 1, 
we have \<^q;a,b{t)\ < e-o-0454<^(9)*/or t > 200. 

Proof. We begin by noting that the orthogonality relations for Dirichlet characters imply that 
X^xCmodg) Ixi^^) ~ = 20(g) (as we show in Proposition 3. l| below). On the other hand, 

if S is the set of characters x (mod q) such that |x(o) — x(&) | > 



then 



J2 \x{a) - xib)\' < Yl 1 + 5^4 = 0(g) -#^ + 4#5. 

X (mod q) x (mod q) x&S 

Combining these two inequalities shows that 20(g) < 0(g) + 3#S', or equivalently ^^S* > |0(g). 
Note that clearly Xo ^ S. 

From Definition |27TT] we have 

\%;a,b{t)\' = n \FMa)-x{b)\t,x)\'= n \FMa)-x{b)\t,x)FMa)-xm,x)l 

X (mod q) X (mod q) 

since every character appears once as x and once as x in the product on the right-hand side. Since 
\Fix,x)\ < 1 for all real numbers x, we can restrict the product on the right-hand side to those 



characters x G 5 and still have a valid upper bound. For any % G 5, Lemma 2.16 gives us 



\Fi\xia) - xib)\t,x)F{\xia) - xib)\t,x)\ < e-o-^^ssixW-xWI* < ^-0.2725* f^j. ^ > 2OO, whence 
\%;aAt)\' < n - xib)\t,x)FMa) - Xib)\t,x)\ < {e-'-''^'')*' < (e-o-O4540W*)2, 

which is equivalent to the assertion of the proposition. □ 

At this point we can establish the required formula for 5{q; a, b), in terms of a truncated integral 
involving ^q-a,b, with an explicit error term. To more easily record the explicit bounds for error 
terms, we employ a variant of the 0-notation: we write A = 0(B) if |y4| < B (as opposed to a 
constant times B) for all values of the parameters under consideration. 

Proposition 2.18. Assume GRH and LI. Let a and b be reduced residues (mod g) such that a is a 
nonsquare (mod g) and b is a square (mod g). IfV{q] a, b) > 531, then 

X. 1 1 fV{,;a,b)-^/' sinp(g)x^ , , ^ 
6{q- a,b) = - + — / ^^%;aA^) dx 



2 27r ./_y(g;a,b)-i/4 X 



/ „-9.O80((?) \ 

+ O 0.03506 , , + 63.67p(g)e-^(''''''^)'^'/2 

V m) J 
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Proof. Our starting point is the formula of Feuerverger and Martin ||51 equation (2.57)], which is 
valid under the assumptions of GRH and LI: 



6{q;a,b) 



1 1 

2 ~ 2^ 



sin((c(g,a) - c{q,b))x] 



X 



(2.10) 



In the case where a is a nonsquare modulo q and 6 is a square modulo q, the constant c(g, a) — c{q, h) 
equals —p{q), so that 

^ sinp(g)x 



S{q;a,b) 



1 1 
2^2^ 



X 



The part of the integral where x > 200 can be bounded using Proposition 2.17 



1 

2^ 



sin p{q)x 



'200 



X 



^g-a.bix) dx 



< 



AOOtt 



poo Q Qi 7p:o -9.080(g) 

/ g-O.O4540(g)x ^ ^Ui i06e 

'200 



The part where x < —200 is bounded by the same amount, and so 



d{q;a,b) = ^ + 

Z Ztt 



r200 



'-200 



sin p(q)x , ^ , ^/ 

^j^'^q-aAx) dx + Ol 0.03506 



-9.O80((?) 

0(g) 



(2.11) 



We now consider the part of the integral where V{q; a, b)~^^^ < x < 200. The hypothesis that 
l^(g;a,&) > 531 implies that V^(g; a, 6)"^/^ < ^, which allows us to make two simplifications. 
First, by Proposition |2. 14 we know that \^q-a,b{,x)\ < ^q-a,b{y{q] a, b)^^/^) for all x in the range 
under consideration. Second, by Proposition |2.12| we have 



^q;a,b{^) = exp ( ~V{q; a, b) ^ Wm{q; a, b)x'^"' j < e" 

^ m=l ^ 



V(g;a,6)a;2/2 



for all real numbers \x\ < ^, since Wi{q; a,b) = \ and all the Wm{q'i a, b) are nonnegative by 
Definition 



2.9 



Since ^ < jj), we see that \^q-a,b{,x)\ < e ^('J;«''')^''^/2 for all x in the range under 
consideration. Noting also that | sui{p{q)x) / x\ < p{q) for all real numbers x, we conclude that 



•200 



sin p{q)x 



y(g;a,6)-l/4 X 



^{x)dx 



^•200 



<p{q) / e-^^'-'^'^'^'^'/'da: < 200p(g)e- 



■y(g;a,6)l/2/2 



'y(g;a,b)-i/4 



The part of the integral where —200 < a; < — V"(g; a, b) is bounded by the same amount, and 



thus equation ( |2.1 1[ ) becomes 



fV(g;a,6)-i/4 



sin p(g)x 



^q;a,b{x) dx 

+ 0^^0.03506 



0(g) vr 



which establishes the proposition. 



□ 
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2.4. Derivation of the asymptotic series. In this section we give the proof of Theorem Our 
first step is to transform the conclusion of Proposition 2.18, which was phrased with a mind towards 
the explicit calculations in Section |5} into a form more convenient for our present purposes: 

Lemma 2.19. Assume GRH and LI. For any reduced residues a and b (mod q) such that a is a 
nonsquare (mod q) and b is a square (mod q), and for any fixed J > 0, 



2 2Tr ^V{q; a, b) J -v{q;a,by/'^ p{q)y / \/V{q]a,b) \^V{q;a,b)J 

+ Oj{V{q;a,b)-') 



Proof. We make the change of variables x = y/ ^/V{q; a, b) in Proposition 2.18 obtaining 

11 fyi'^-'^'^y^' sin {piq)y/^Viq;a,b)) ^ [ V \ dy 



2 2n 



-y(g;a,b)i/4 y/^V{q;a,b) \^V{q;a,b)J y/V{q;a,b) 

_/ p-5. 120(g) 

+ 0.06217^—— + 63.67p(g)e-^(^''^''')'^'/' 
the main terms of which are exactly what we want. The lemma then follows from the estimates 



for any fixed constant J: these estimates hold because V{q] a, b) ~ 20(g) log g by Proposition 3.6 



while the standard lower bound 0(g) ^ g/ log log g follows from equation ( |5.19| ). □ 



We will soon be expanding most of the integrand in Lemma 2.19 into a power series; the fol- 
lowing definition and lemma treat the integrals that so arise. 

Definition 2.20. For any nonnegative integer k, define {2k — 1) ! ! = {2k — 1) {2k — 3) ■ ■ ■ 3 • 1, where 
we make the convention that (—1)!! = 1. Also, for any nonnegative integer k and any positive real 
number B, define 

Mk{B)= r y^^e-y'/^dy. 

J-B 



Lemma 2.21. Let J and B be positive real numbers. For any nonnegative integer k, we have 
Mk{B) = {2k - l)!!v^ + 0fc,j(5-^). 

Proof. We proceed by induction on A;. In the case = 0, we have 

/B poo POO 

e-y'/^ dy= e-y'/^ dy-2 e"*^'/' dy 
B J-oo J B 

= V2^ + o(^j e-^y^^dy^ 

= V2^ + O (^|e-^'/2^ =V2^ + Oj{B-') 
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as required. On the other hand, for > 1 we can use integration by parts to obtain 



J-B -B J- 

= 0{B^^-^e-''''^) + {2k - l)Mk-i{B). 



B pB 

y2k-2^-y^/2^y 

-B J-B 



Since the error term B'^^ ^^/^ is indeed Ok,j{B the lemma follows from the inductive 
hypothesis for Mk-i{B) . □ 

The following familiar power series expansions can be truncated with reasonable error terms: 

Lemma 2.22. Let K be a nonnegative integer and C > 1 a real number. Uniformly for \z\ < C, 
we have the series expansions 



K 

sm z x-^ , . , „■ z 



2j 



, E(-i)'p^^oc..(i.i--). 

Proof The Taylor series for e^, valid for all complex numbers z, can be written as 

K ^ oo 



The function J2'jLo / {j + K + 1)\ converges for all complex numbers z and hence represents 
an entire function; in particular, it is continuous and hence bounded in the disc \z\ < C. This 
establishes the first assertion of the lemma, and the second assertion is proved in a similar fashion. 

□ 

Everything we need to prove Theorem |1.1| is now in place, once we give the definition of the 
constants Sg-a,b{^,j) that appear in its statement: 

Definition 2.23. For any reduced residues a and b (mod q), and any positive integers j < i, define 

^ ^' i2+2i3+-+iie+i=i-j k=2 ^' 

where the indices ^2, • • • , ie+i take all nonnegative integer values that satisfy the constraint ^2 + 
2z3 + ■■■ + = i - j. Note that Sg.a,b{0,0) = 1 always. Since Wk{q;a,b) < by 

Proposition 2.10 we see that Sg-a,b{(^,j) is bounded in absolute value by some (combinatorially 
complicated) function of i uniformly in q, a, and b (and uniformly in j as well, since there are only 
finitely many possibilities {0,1, . . . ,£} for j). <C> 

Proof of Theorem \L1\ To lighten the notation in this proof, we temporarily write p for p{q), 5 for 
S{q; a, b), V for V{q; a, b), and Wk for Wk{q] a, b). We also allow all 0-constants to depend on K. 
Since 5 is bounded, the theorem is trivially true when V is bounded, since the error term is at least 
as large as any other term in that case; therefore we may assume that V is sufficiently large. For 
later usage in this proof, we note that p <ti V^^'^, which follows amply from the bound p <tie <f 



mentioned in Definition 1 .2 and the asymptotic formula V ~ 2(f){q) log q proved in Proposition 3.6 
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We begin by noting that from Proposition |2. 12 

/ oo 



„2fc 



k=\ 



'kx'" + oiyx 



fc=l 

exp(-V^iyfex2M(l + 0(\/, 
^ fc=i ^ 



2(K+2) 



)) (2.12) 



uniformly for all \x\ < min(^, V ^/^), where the second equality follows from the upper bound 



given in Proposition 2.10 Inserting this formula into the expression for a, h) from Lemma 2.19 
applied with J = K + 2, gives 



5 



1 
2 



P 



sm.{py/yV) 



2TxyV py/yV 



exp 



k=l 



my 

yk- 



2k 



o 



2{K+2) 



dy 

+ o(K-^-2). 



This use of equation < |2.12[ ) is justified because the argument y/VV of $q;a,fe in the integral in 
is at most V^^'^/ \/V < t, by the assumption that V is sufficiently large. To simplify 



2.19 



Lemma 

the error term in the integral, we ignore all of the factors in the integrand (which are bounded by 1 
in absolute value) except for the k = 1 term, in which Wi = ^, to derive the upper bound 



sm{py/ VV) 



-yi/4 py/ 
Therefore 



V 



exp 



A'+l 

E 

k=l 



2k \ y2{K+2) 



Wk]/ 

yk-i I yK+i 



1 



6 



1 
2 



P 



sm 



ipy/W) 



27rvy j-yi/4 pyjyV 



exp 



E 

fe=i 



yk-l 



dy + 



P 



yK+?,/2 



(2.13) 



The integrand in equation ( |2.13| ) is the product oi K + 2 functions, namely K ^ \ exponential 
factors and a factor involving the function (sin z) / z. Our plan is to keep the first exponential 
function as it is and expand the other factors into their power series at the origin. Note that the 
argument of the A;th exponential factor is at most WkV^'^^'^ in absolute value, which is bounded 
(by a constant depending on K) for all /c > 2 by Proposition 2.10 Similarly, the argument of the 
function (sin 2)/^ is bounded by pV^/"^/ \/V <^ 1. Therefore the expansion of all of these factors, 
excepting the exponential factor corresponding to A; = 1, into their power series is legitimate in the 
range of integration. 

Specifically, we have the two identities 



E 



3=0 



(2j + l)! 



K 

E 

ifc=0 



1)''= (Wky 



2k \ «fe 



V 



exp 



exp 



py/yV 
Wuy^^ 



o 



(py) 



2iK+l) 



yK+i 



yk-l 
■ yk-l 



+ 



o 



Wky' 

yk~l 
^2k{K+l) 

~yK+r 



2k\K+l 
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where the error terms are justified by Lemma 2.22, in the last equality we have used Proposi- 
tion [2?T0] to ignore the contribution of the factor Wk to the error term (since the 0-constant may 



depend on K). From these identities, we deduce that 



K 

E 

j=0 



'^y ipy) 



2i 



(2j + l)! 



K+l 



K 



HE 

k=2 ^ik=0 

sm{py/y/V) 



py/VV 



+ 



,2k \ «fc 



X e 



K+l 

n 

k=2 
K+l 



exp 



sm{py/VV) 



yk-i 

VK+l 

■ yk-l 

yk-i 



o 



(WkT 



2k\K+l 



I n 



27^+2 



yK+i 



(The computation of the error term is simplified by the fact that all the main terms on the right-hand 
side are at most 1 in absolute value, so that we need only figure out the largest powers of y and p, 
and the smallest power of V , that can be obtained by the cross terms.) 
Substituting this identity into equation ( |2.13[ ) yields 



2 27rv^ 7-^1/4 



E 

i=o 



(2j + l)! 



K+l 



K 



+ 



HE- 

k=2 ^ik=0 
^2K+2 



2k \ *fe 



^(^+2)(^^1)^^-.V2^^^ + 



yk-l 
P 



dy 



yK+3/2 



2 2nVV 



K K 



K 



EE- E 

j=0 12=0 lK+l=0 

K+l 



2j 



{2j + iy.v^ 



X 



n 

k=2 



1 



-Wk 



Zfc! \V>^~^ 



2^ \ 

M,+2n+■■■+iK+l)^K^. {V'/') + O 



P 



,2K+3 



yK+3/2 



where M was defined in Definition 2.20 Invoking Lemma 2.21 and then collecting the summands 
according to the power £ = j + ii + 2i2 + ■ ■ ■ + Ki^+i of V in the denominator, we obtain 



A ^ I P 

2 ^/2^ 



K K 



K 



EE- E 

j=0 12=0 tK+l=0 



K+l 



i-iy p'' 

{2j + iyv^ f-i'fc'V^'-' 



X ((2(j + 2z2 + --- + {K + ly^+i) - 1)!! + 0(V-(^+i)))^ + 



K{l+K{K+l)/2) , 2,- 

P V- 1 ^ (-l)V"^ 

yi2-.^2j + iy 



E=0 



2 y/2^ 

x(2(£ + .2 + --- + zx+i)-l)!!)+o(^^) 



K K 

E- E 

n=0 tK+i=0 

i2+2i3 + -+KiK+l=i-j 



01 

K+l 

n 

k=2 



P 



,2K+3 



yK+3/2 



(2.14) 
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where we have subsumed the first error term into the second with the help of Proposition 2.10 

The proof of Theorem |1.1| is actually now complete, although it takes a moment to recognize 
it. For < i < K, the values of j that contribute to the sum are < j < i, since i — j 
must be a sum of nonnegative numbers due to the condition of summation of the inner sum. In 
particular, all possible values of j and the ik are represented in the sum, and the upper bound of K 
for these variables is unnecessary. We therefore see that the coefficient of p^^V^^ on the right-hand 
side of equation ( |2.14[ ) matches Definition 2.23 for Sg-a,b{(^, j)- On the other hand, for each of the 
finitely many larger values of i, the ith summand is bounded above by p^^V^^^^ times some 
constant depending only on K (again we have used Proposition |2.10 to bound the quantities Wk 
uniformly), which is smaller than the indicated error term once the leading factor p/ \/2ttV is taken 
into account. □ 



3. Analysis of the variance V{q; a, h) 



In this section we prove Theorems 1.4 and 1.7 as well as discussing related results to which our 



methods apply. We begin by establishing some arithmetic identities involving Dirichlet characters 

Using thes e id entities and a classical formula for 6(x), we 
in Section 



and their conductors in Section 3.1 



complete the proof of Theorem 



1.4 



3.2 



The linear combination of values 



L 



that defines M*{q; a, h) can be converted into an asymptotic formula involving the von Mangoldt 
A-function, as we show in Section 3.3 and in this way we establish Theorem |1. 7 [ 

Our analysis to this point has the interesting consequence that the densities a, h) can be 
evaluated extremely precisely using only arithmetic content, that is, arithmetic on rational num- 
bers (including multiplicative functions of integers) and logarithms of integers; we explain this 



consequence in Section 3.4 Next, we show in Section 3.5 that the limiting logarithmic distribu- 
tions of the differences E{x] q, a) — E(x; q, h) obey a central limit theorem as q tends to infinity. 



Finally, we explain in Section 3.6 how our analysis can be modified to apply to the race between 
the aggregate counting functions 7r(x; g, A^) = < x: p is a quadratic nonresidue (mod g)} 
and 7r(x; g, R) = < x: pis a quadratic residue (mod q)}. 

3.1. Arithmetic sums over characters. We begin by establishing some preliminary arithmetic 
identities that will be needed in later proofs. 

Proposition 3.1. Let a and b be distinct reduced residue classes (mod q). Then 

J2 \x{a)-xib)\' = 2<P{q), 

X (mod q) 

while for any reduced residue c ^ 1 (mod q) we have 

l^(^) - xib)W{c) = -0(g) {igicab-^) + Lg{cba-^)) , 

X (mod q) 



where tg is defined in Definition 1.5 



Proof. These sums are easy to evaluate using the orthogonality relation [fTTl Corollary 4.5] 

: 0(g)6,(m). (3.1) 



X (mod q) 



(f){q), if m = 1 (mod g) 
0, if m ^ 1 (mod g) 
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We have 

X (mod q) x (mod <?) 

= 5Z 2- 5^ X{ab-')- Yl x(&«-')= 20(g) + + 0, 

X (mod q) x (mod q) x (mod g) 

since a ^ b (mod g). Similarly, 

E Ix(a)-X(&)rx(c)= 5^ (2-x(a)xW-x(&)xMMc) 

X (mod q) X (mod g) 

= 2x(c)- J] X(cari)- Y Xicba-') 

X (mod g) X (mod g) x (mod q) 

= — (f){q) (ig(ca6^^) + Lg{cba^^)) . 

□ 

The results in the next two lemmas were discovered independently by Vorhauer (see lITTl Section 
9.1, problem 8]). 

Lemma 3.2. For any positive integer q, we have 

YHq/d)m = mY^^^ 

d\q p\q 

while for any proper divisor sofqwe have 

A{q/s) 



Y^ii/d)m = m-^^^^^^ 



Proof. For the first identity, we group together the contributions from the divisors d such that q/d 
is a power of a particular prime factor p of q. If \\ q, write q = mp^ , so that p \ m. We get a 
contribution to the sum only when d = mp''^^ for some 1 < k < r. Therefore 

r r 

YKq/d)m = 5^5^A(/)0(mp'^-'=) = 5^0(m)logp^0(/"'=). 

d\q P^\\q k=l P^Wq k=l 

Since X]a|6 '^('^) ~ ^ ^'^y positive integer b, the inner sum is exactly p^~^. Noting that 0(m) = 
0(g) since p f n, we obtain 

as claimed. 

We turn now to the second identity. If g/s has at least two distinct prime factors, then so will 
q/d for every divisor d of s, and hence all of the K{q/d) terms will be 0. Therefore the entire sum 
equals 0, which is consistent with the claimed identity as Rq{s) = as well in this case. Therefore 
we need only consider the case where g/s equals a prime power p*. 
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Again write q = mp^ with p\ m. Since s = q/p^ = mp^ ^, the only terms that contribute to the 
sum are d = mp^''^ for t < < r. By a similar calculation as before, 



Y^K{q/d)<P{d) = ^A(/)0(mp'' = (f){m) log pJ2(PiP" 

d\s k=t k=t 



p'^ Hp - 1 



0(g): 

</^lP' J P'^ ^IP- -LJ 

since g/s = p*. This establishes the second identity. □ 



Recall that x* denotes the primitive character that induces x and that q* denotes the conductor 

of X*- 

Proposition 3.3. For any positive integer q, 



logg* = 0(g)Mogg-^^^^^Y 



X (mod q) ^ p\q 

while ifa^l (mod q) is a reduced residue, 

/ Ni * ,/ X A(g/(g,a - 1)) 
J2 x(a)logg =-0(g) ^(^/(^^,_i)) - 

Proof. First we show that 

^ x(«)logg* = logg x(a)-l]A(gM (3.2) 

X (mod (j) X (mod q) d\q X (mod d) 

for any reduced residue a (mod q). Given a character x (mod g) and a divisor d of g, the character 
X is induced by a character (mod d) if and only if is a multiple of g*. Therefore 

5^A(g/d) 5^ x(a)= E x(a)$^A(g/rf). 

x(modd) x (mod g) d\q 

q-\d 

Making the change of variables c = q/d, this identity becomes 
5^A(g/rf) Yl X{a)= Y x{a)Y^(^) 

d\q x(modc() x (mod g) c|g/(j* 

= Y x(a)log|r = logg ^ - Y x{a)^ogq\ 

X (mod q) X (mod q) x (mod q) 



which verifies equation (3.2). 



If a = 1 (mod g), then equation ( |3.2[ ) becomes 
Y logg*=logg 5^ l-5^A(g/rf) 1 

X(modij) x(modg) d\q x (mod d) 



0(g) log g - V A(g/ d)(j){d) = (f){q) log g - (j){q) V 

Z — / Z — / p — 1 
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by Lemma 3.2 establishing the first assertion of the lemma. If on the other hand a ^ 1 (mod q), 
then applying the orthogonality relation p.l[ ) to equation p.2[ ) yields 

J2 x(«)logg* = O-^A(g/rf)0(rf)6rf(a) 

X (mod q) d\q 

*fr-i) a -1)) 



by Lemma 3.2 again, establishing the second assertion of the lemma. □ 

Finally we record a proposition that involves values of both primitive characters and characters 
induced by them. 

Proposition 3.4. Let pbe a prime and e a positive integer, and let r be a reduced residue (mod q). 
Ifp \ q, then 

X (mod q) 

On the other hand, ifp \ q then 

( \( *( e-, ( e-,\ fHl/P'^)^ ifrp^ = 1 (mod q/p"), 

X (mod q) k 

where u > 1 is the integer such that p'^ \\ q. 

Proof. The first assertion is trivial: if p \ q then x*{p'^) = x{p'^) for every character x (mod q). If 
p I q, then xip'^) = for every x, and so 

X{r){x*{pl-X{pl) = J2 X{r)x*{pl= J2 

X (mod q) X (mod q) x (mod q) 

since x(r) = x*(r) for every x (mod q) due to the hypothesis that (r, g) = 1. Also, we have 
X*{p'^) = for any character x such that p | g*, and so 

X (mod 5) X (mod q) x (mod 5 /p" ) 



since (p^, g/p"^) = 1. The second assertion now follows from the orthogonality relation (3.1 ). □ 



3.2. A formula for the variance. Recall that b{x) was defined in Definition 1.3, we record a 



classical formula for b{x) in the next lemma, after which we will be able to prove Theorem 1.4 



Lemma 3.5. Assume GRH. Let q > 3, and let x be any nonprincipal character modulo q. Then 

h{x) = log ^ - 70 - (1 + log 2 + 2 Re 



TT L[l,X*) 

Proof. Since the zeros of L{s,x) L{s,x*) on the line Rez = ^ are identical, it suffices to 
show that for any primitive character x modulo q, 

E r^ = logf-7o-(l + X-(-l))log2 + 2Re|^. 

L(l/2+i7,x)=0 
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There is a certain constant B{x) that appears in the Hadamard product formula for L{s, x)- One 
classical formula related to it lITTl equation (10.38)] is 



ReBix) 



y Re-. 

6^ p 



(3.3) 



peC 

0<Rep<l 
LiP,x)=0 



We can relate B{x) to Kx) under GRH by rewriting the previous equation as 



-2ReB(x) 



E 



Re 



Re 



(3.4) 



L(l/2+j7,x)=0 L(l/2+i7,X)=0 

On the other hand, Vorhauer showed in 2006 (see [VL equation (10.39)]) that 

B(,, = _l,ogi-:^(l,X) + | + l±|zi),eg2. 

Z TT L Z Z 

Taking real parts (which renders moot the difference between % and x) and comparing to equa- 
tion p.4[ ) establishes the lemma. □ 



Proof of Theorem \L4\ We begin by applying Lemma 3.5 to Definition 1.3 for \/(g;a, 6), which 
yields 

V{q-aM= E lx(«)-x(&)r(log^-7o-(l + x(-l))log2 + 2Re^}^* 



X (mod q) 



Yl Ix(a)-x(&)l'logg*-(7o + log27r) ^ \^(a) - xm' 

X (mod g) X (mod q) 

-log2 ^ |x(a)-xWI'x(-l) + 2M*(g;a,6), 



(3.5) 



X (mod g) 



recalling Definition 1.6 for M*{q; a, b). We are permitted to reinclude the principal character xo in 
the three sums on the right-hand side, since the coefficient \xoio.) — Xo(^)P always equals 0. 

The second and third terms on the right-hand side of equation p.5[ ) are easy to evaluate using 
Proposition |3.1[ we have 

-(7o + log27r) = -2(70 + log 27r)</>(g) 



(3.6) 



X (mod q) 



and 



-log2 Y \xia)-xib)Wi-l) = i\og2)^iq){L,i-ab-') + t,{-ba~')) 

X (mod q) 

= {2log2)<f){q)L,{-ab-'). (3.7) 
The first sum on the right-hand side of equation p.5[ ) can be evaluated using Proposition |3.3[ 

Y \x{a)-xib)\Hogq* = ^ - xiab'') - x{ba~')) log q* 

X (mod q) X (mod q) 



Aiq/{q,ab-'-l)) 
0(g/(g,a6-i-l)) 



+ 



A(g/(g,&a-^-l)) 
0(g/(g,6a-i-l))' 
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Since {q, mn) = (g, n) for any integer m that is relatively prime to q, we see that (g, ab ^ — 1 

(g, a — b) = {q,b — a) = (g, 6a"^ — 1), and therefore 

A(g/(g,a-6)) 

P 



E - xWPlogg* = 20(g) Mogg - ^ ^ 

X(modg) ^ p|g ^ 

Substituting the evaluations p.6| ), ( |3.7| ), and p.8| ) into equation p.5| ), we obtain 



- &)) 



(3.8) 



l^(g;a,6) = 20(g) logg 



logP ^ A(g/(g,a - 6)) 
I P-1 <Piq/{Q,a-b)) 

- 2(70 + log27r)0(g) + (21og2)0(g)6,(-ar^) + 2M*(g; a, b) 

= 20(g) (£(g) + /^^(a - 6) + 6,(-ari) log 2) + 2M*(g; a, 6), 

where £(g) and Kq{n) were defined in Definition 1.5 This establishes the theorem. 

Theorem T]4]has the following asymptotic formula as a corollary: 

Proposition 3.6. Assuming GRH, we have V{q] a, b) = 2(j){q) logg + 0{(f){q) log logg). 

Proof. First note that the function (logt) /(t — 1) is decreasing for t > 1. Consequently, A(g)/0(g) 
is bounded by log 2. Also, letting pj denote the jth prime, we see that 



□ 



p _ 1 - 



log Pj 



- < logp^(g) < loga;(g) < log logg, 



where the final inequality uses the trivial bound ui{q) < (logg) /(log 2). From Definition 1.5 
we conclude that C{q) = logg + O(loglogg). Next, Kq{a — b) is bounded by log 2 as above, 
and Lq{ab^^) is of course bounded as well. Finally, on GRH we know that L '{\, x*)/L(l, %*) ^ 
log logg* < log logg (either see (HI, or take y = log^ g in Proposition 3.10), which immediately 
implies that M*{q; a, b) <^ (f){q) log logg by Definition 1.6 The proposition now follows from 
Theorem O □ 

3.3. Evaluation of the analytic term M*(g; a, b). The goal of this section is a proof of Theo- 
rem|1.7[ We start by examining more closely, in the next two lemmas, the relationship between the 



quantities M*{q; a, b) and M(g; a, b) defined in Definition 1.6 Recall that e(g; p, r) was defined in 
Definition [LSI 

Lemma 3.7. Ifp^ \\ q, then V — = , w ^ -eia ^iw 

rp'^=l (mod q/p") 

Proof. If r is not in the multiplicative subgroup (mod q/p") generated by p, then the left-hand side 
is clearly zero, while the right-hand side is zero by the convention that e(g; p, r) = oo in this case. 
Otherwise, the positive integers e for which rp^ = 1 (mod q/p") are precisely the ones of the form 
e(g; p, r) + ke{q] p, 1) for > 0, since e(g; p, r) is the first such integer and e(g; p, 1) is the order 
of p (mod q/p^). Therefore we obtain the geometric series 



E 



rp 



e>l 
El (mod g/p") 



pe 



fc=0 



pe(q\p,r)+ke{q;p,l) pe(g;p,r)^]^ _ p-e(g;p,l)^ 



as claimed. 



□ 
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Definition 3 .S. If p'^ \\ g, define 

1 logJ9 



hoiq;p,r) 



and 

Ho{q; a,b) = ^ {ho{q; p, ah~^) + /io(g; p, ha'^) - 2/io(g; p, 1)) • 



We will see later in this section, in the proof of Theorem 1 .7 that and are very close to 
the functions h and H also defined in Definition 1.8 Notice that if q is prime, then /io(?; q-, r) = 
(log q) I q{q — 1) independent of r and thus H{q; a,b) = for any a and 6. <) 

The next lemma could be proved under a hypothesis much weaker than GRH, but this is irrele- 
vant to our present purposes. 

Lemma 3.9. Assume GRH. If a and b are reduced residues (mod q), then 

M*{q- a, h) = M{q- a, h) + 4>{q)Ho{q] a, b), 



where M*{q; a, b) and M{q; a, b) are defined in Definition 1.6 
Proof. We begin with the identity 

This identity follows from the fact that the Euler product of L(s, x) converges uniformly for 
Re(s) > 1/2 + e; this is implied by the estimate ^p^^xip) a;^''^ log^ x which itself is a 
consequence of GRH. 
Therefore 

M*iq;a,b)-M{q;a,b)= \^^^^ ' ^^^^l^J^^ ' J^^) 



X (mod q) 
Xt^XO 



y-^oo ' ' p'^ 

X (mod q) p<y e=l 

Xt^Xo 



oo ^ 

lim^logpj^- J2 {x{ab-') + x{ba-')-2){x*{p')-x{pn): 



p<y e=l X (mod q) 



where the inserted term involving xo is always zero. Proposition 3.4 tells us that the inner sum 
vanishes except possibly when the prime p divides q; invoking that proposition three times, we see 
that 



M* (g; a, b) - M(g; a, 6) = J] ct>{q/p'') \ogp 



X 



E -+ E --2 E - 

e>l ^ e>l ^ e>l ^ 



aft- V=l (mod g/p") 6a-ip'==l (mod g/p"") p==l (mod g/p"^) 
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We can evaluate these inner sums using Lemma 3.7 by comparison with Definition 3.8 

logp / 1 



M*{q;a,b)-M{q;a,b)=(j){q)J2 



1 „ 1 

+ 



= <P{q)Ho{q;a,b), 

which establishes the lemma. □ 

We will need the following three propositions, with explicit constants given, when we undertake 
our calculations and estimations of 5{q] a, h). Because the need for explicit constants makes their 



derivations rather lengthy, we will defer the proofs of the first two propositions until Section 5.2 
and derive only the third one in this section. 

Proposition 3.10. Assume GRH. Let x^e a nonprincipal character (mod q). For any positive real 
number y, 

L'{l,x) ^ ^ x{n)A{n) -/ 14.27 log g + 16.25 16.1 log g + 17.83 \ 

L{i,x) ^ ^ ' V y'^' y'/' J' 

Proposition 3.11. Ifl<a<q, then 

y AW^_„/,2 ^ A(a) ^ ^ plog^g ^ 3.9351ogg 
n a \ q q 

n=a (mod q) 

Assuming these propositions for the moment, we can derive the following explicit estimate for 
M* (g; a, b), after which we will be able to finish the proof of Theorem |1. 7 [ 

Proposition 3.12. Assume GRH. For any pair a, b of distinct reduced residues modulo q, let ri and 
r2 denote the least positive residues ofab^^ and ba^^ (mod q). Then for q > 150, 

M*( M ^( ^/^^(^i) ^ ^(^2) ^ ^ . ,A ^^/23.6190(g)log2g 
M {q; a, b) = (f){q) \ h Ho{q; a,b) +0' 



ri r2 / \ Q 

" we 



Proof. The bulk of the proof is devoted to understanding M(g; a, b). From Proposition 3.10 
have 



X (mod q) n=l 



■l4.271ogg + 10.6 ^ 16.11ogg + 13.1 



yl/2 y3/4 



J2^^~n/y ^ {x{ba-'n) + x{ab-'n)-2x{n)) 



n 

n=l X (mod 



, 14.271ogg + 16.25 , 16.1 log g + 17.83 , , 
+ ( ^17^ + ) ) , (3.9) 
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and using the orthogonality relations in Proposition [3]T} we see that 

M(g;a,6)=0(g)f V ^6""/^+ V ^e^/y -2 V Me-/.") 
\ ^-^ n ^-^ n ^-^ n J 

n=ab~^ (mod q) n=6a~i (mod q) n=l (mod q) 

At this point we choosey = g^. We calculate that (14. 2 7 log g + 16.25)/g+(16.1 logg + 17.83)/g^/2 < 
3.816(log^ g)/g for g > 150, and so 

M(g;a,6)=0(g)f V Me-V.^+ V ^e-«/^^-2 V Me^/.^) 
\ ^-^ n ^-^ n ^-^ n I 

n=ab-^ (mod q) n=ba-^ (mod q) n=l(mod<5r) 

■15.2630(g) log^g 



+ 



Let ri and r2 denote the least positive residues of ab ^ and ba ^ (mod g). Using Proposition 3.11 
three times, we see that 

M(g; a, b) ^ 0(g) + ^ - 2^ + ^(K^ + ^ ^ ^^15.2630(g) log- g^ 

= 0(g) f ^ + ^ ^/^ 36.6190(g) log- g 



log-g 



ri ^2 / V 1 

for g > 150. With this understanding of M(g; a, 6), the proposition now follows for M*(g; a, 6) by 
Lemma im □ 

Proof of Theorem \T77\ Since Proposition 3.12 tells us that 

all we need to do to prove the theorem is to show that 

Ho{q; a, b) = H{q; a,b) + O 

\ 1 

The key observation is that p^^i'P''^^ = 1 (mod q/p'^) and > > 1, and so ^^('J'P'I) > 

q/p'^. Therefore by Definitions 1.8 and 3.8[ we have hQ{q]p,r) = h{q;p,r){l + 0{p'^/q)) and 
h{q;p,l) <^ i}ogp) / (j){p^){q/ p^) <ti (logp)/g. We see that 

Ho{q] a,b) = ^ {ho{q; p, ab~^) + /io(g; p, ha"^) - 2ho{q; p, 1)) 

p'lk 

= E ( + h{q;p, ba-')) (1 + 0(f )) + O(^) 

It is certainly true that h{q;p,r) <^ {\og p)/(f){p'^) <^ {\ogp)/p'^, and so the previous equation 
becomes 

logg' 



H,{q; a, b) = H{q; a, 6) + O + ^ = i/(g; a, 6) + O 



■pile 
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which establishes the theorem. 



□ 



3.4. Estimates in terms of arithmetic information only. The purpose of this section is to show 
that the densities 6{q; a, b) can be calculated extremely precisely using only "arithmetic informa- 
tion". For the purposes of this section, "arithmetic information" means finite expressions com- 
posed of elementary arithmetic operations involving only integers, logarithms of integers, values 
of the Riemann zeta function at positive integers, and the constants tt and 70. (In fact, all of these 
quantities themselves can in principal be calculated arbitrarily precisely using only elementary 
arithmetic operations on integers.) The point is that "arithmetic information" excludes integrals 
and such quantities as Dirichlet characters and L-functions, Bessel functions, and trigonometric 
functions. The formula we can derive, with only arithmetic information in the main term, has an 
error term of the form OAiq^"^) for any constant A > we care to specify in advance. 



To begin, we note that letting y tend to infinity in equation ( |3.9[ ) leads to the heuristic statement 



M{q-a,b) = f2^ Yl {x{ba~'n) + x{ab-'n)-2xin)) 

n= 1 X (mod q) 

00 A / \ 



n 

n=l 



A(ra) A(ra) A(n) 



n=ab-'^ (mod q) n=ba-'^ (mod q) n=l (mod q) 

where the "=" warns that the sums on the right-hand side do not individually converge. In fact, 
using a different approach based on the explicit formula, one can obtain 

\ ^-^ n ^-^ n ^ — ^ n 

l<n<j/ l^fi^y 
n=ah~^ (mod q) n=ba-^ (mod q) n=l (mod q) 

^^(m^y (3.10) 



In light of Theorem 1 1.4| in conjunction with Lemma [X9| we see that we can get an arbitrarily good 
approximation to V{q;a,b) using only arithmetic information. 

By Theorem 1 1.1[ we see we can thus obtain an extremely precise approximation for 5{q; a, b) as 



long as we can calculate the coefficients Sq-a,b{'^, j) defined in Definition 2.23 Inspecting that defi- 
nition reveals that it suffices to be able to calculate Wmiq', a, b) (or equivalently Wm{q] a, b)V{q] a, b)) 
arbitrarily precisely using only arithmetic content. With the next several lemmas, we describe how 
such a calculation can be made. 

Lemma 3.13. Let n be a positive integer, and set i = [|J. There exist rational numbers C„ 1, . . . , 
Cn,e such that 

2Re TT^:^ :— + ^ . + . +••• + 



(l/4 + t2)n V(l/2-it)"/ (l/4-Ft2)n-l (l/4 + t2)n-2 (1/4 + t^)"" 

for any complex number t. 
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Proof. Since 



op^^ 1 _ 1 I 1 _ {1/2 + itr + {1/2- ity 



(1/2 -it)" (1/2 -it)" (1/2 + it)" (l/4 + t2)n 

it suffices to show that 

(1/2 + itT + (1/2 - 2t)" ^ a,o -C„,£ 

(1/4 + t^Y (1/4 + t2)n + (1/4 + t2)n-l + " " " + (^^4 ^ ^2)n-£ ' <^ " 

where each Cn,m is a rational number and C„ o = 1- In fact, we need only show that this identity 
holds for some rational number Cnfi, since multiplying both sides by (1/4 + t^)" and taking the 
limit as t tends to i/2 proves that C„_o must equal 1. 
Using the binomial theorem, 

n 

(1/2 + itr + (1/2 - It)- = il) iir'm' + {-^t)') 

= E(;)(a""^(2(-i)^*^') 

3=0 

=2E(s)(irViv(a+*^)-a' 

j=0 

j=0 m=0 

which is a linear combination of the expressions (1/4 + t'^)^, for < m < i, with rational 



coefficients not depending on t. Dividing both sides by (1/4 + t )" establishes equation p.l 1[ ) for 



suitable rational numbers Cn,m and hence the lemma. □ 

For the rest of this section, we say that a quantity is a fixed Q-linear combination of certain 
elements if the coefficients of this linear combination are rational numbers that are independent 
of q, a, b and x (but may depend on n and j where appropriate). Our methods allow the exact 
calculation of these rational coefficients, but the point of this section would be obscured by the 
bookkeeping required to record them. 

Definition 3.14. As usual, T{z) denotes Euler's Gamma function. For any positive integer n and 
any Dirichlet character x (mod q), define 

bn{x) = Y n , 2\n' 

L(l/2+i7,x)=0 

so that bi{x) = b{x) for example. <) 
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Lemma 3.15. Assume GRH. Let nbe a positive integer, and let xbe a primitive character (mod q). 
Then bn{x) a fixed Q-linear combination of the quantities 



\ log-, 


-logrw 







^=(l+€)/2 



logr(s) 



Re 



d 
ds 



^og L{s,x) 



s=(l+0/2 

Re 



s=l 



^og L{s,x) 



(3.12) 



where ^ = ifx{-l) = I and ^ = 1 ifx{-l) = -1- 



Remark. Since the critical zeros of L{s, x) and L(s, x*) are identical, the lemma holds for any 
nonprincipal character x ifj in the set p.l2| ), we replace q by q* and L(s, x) by x*)- 

Proof. For primitive characters x, Lemma 33]tells us that 



Kx) = log 



TT 

log - + 

TT 



7o-(l + x(-l))log2 + 2Re 



L'{i,x) 
Hi,x) 
L'H.x) 



(1+0/2 -^U^Xj 



which establishes the lemma for n = 1. We proceed by induction on n. By Lemma 3.13 
that 



we see 



Ux) 



E 



1 



7eM 

L(l/2+i7,x)=0 



= E 

L(l/2+i7,x)=0 



(1/4 + 7 



2 Re 



2\n 



+ 



CnA 



+ 



a 



n,2 



(1/2-^7)" (1/4 + 72)"-! 
Cn,lbn-1 (X) + • • • + a/&n-^(x) + 2 5^ Re 



7eR 

L(l/2+i7,x)=0 



(l/4 + 72)«-2 

1 

(1/2 -^7)^^ 



+ ■■■ + 



n,e 



(1/4 + 72)"-^ 

(3.13) 



(where i = [fj). By the induction hypothesis, each term of the form Cn,mbn-m{x) is a fixed 
Q-linear combination of the elements of the set p.l2[ ); therefore all that remains is to show that 
the sum on the right-hand side of equation p.l3[ ) is also a fixed Q-linear combination of these 
elements. 

Consider the known formula [fTTl equation (10.37)] 



-^^og L{s,x) = B{x) - ^logT 



1, q 

Z vr 



E 



1 1 

+ - 

- P P 



where J2p denotes a sum over all nontrivial zeros of L(s, x) and -B(x) is a constant (alluded to in 
the proof of Lemma [33]). If we differentiate this formula n — 1 times with respect to s, we obtain 



(is" 



logI^(s,x) 



(is" 



logr 



-1 



\n-l, 



n - 1)! 



(s - pY 
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Setting s = 1 and taking real parts, and using GRH, we conclude that 



(1/2 -i-fY 

7GIR ^ ' p 

L(l/2+j7,x)=0 



^^Re_,ogL(,.,) + _logr 



which is a fixed Q-linear combination of the elements of the set p.l2[ ) as desired. (Although 
logr(s)]^_^^^^^^2 ^iid logr(^)]^__^ differ by a factor of 2", this does not invalidate the 
conclusion.) □ 

The following three definitions, which generalize earlier notation, will be important in our anal- 
ysis of the higher-order terms Wniq; a, b)V{q; a, b). 

Definition 3.16. For any positive integers q and n, define 

-.(.)-5:(-lr(„:.)(.(«■-•)('o.^E3 



|j|<n 



^ <Piq/iq,a^ -¥)))■ 



Definition 3.17. Let x be a Dirichlet character (mod q), and let a and b be integers. For any positive 
integers j < n, define 



2n 



X (mod q) 
Xt^XO 



and 



2n 



X (mod q) 

Xt^XO 



■ ^og L{s,x* 



s=l 



log^(s,x) 



s=l 



SO that A^* ^^(g; a, 6) = M*{q;a,b)/(f){q) and A^^i(g; a, 6) = M{q; a,b)/(f){q) for example. One 
can use Lemma [3. 191 and Perron's formula to show that 

2j \ A(n) log-'"^ n ^ ^ flog'^^qy^ 

J + h 



\i\<j 



o, 



n<y 
n=a^b~'' (mod q) 



n 



in analogy with equation p.lO[ ). 

Definition 3.18. For any distinct reduced residue classes a and b (mod q), define 



<> 



(hip" , 
pik ^ ' \i\<j 
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2n 



E 



e>l 

a'6-V=l (mod g/p'') 



for any integers 1 < j < n. Notice that the inner sum is 



e>l ^ e>l 

a*fe~'p'^ = l (mod g/p") e=e{q;p,a''b~') (mod e{q; p, 1)) 



where e(g; p, r) is defined in Definition 1.8 It turns out that the identity 



e=r (mod s) 

(in which denotes the Stirling number of the second kind) is valid for any positive integers m, 
p, r, and s such that r < s (as one can see by expanding (sk + r)™ by the binomial theorem and 
then invoking the identity [6, (equation 7.46)]). Consequently, we see that (g; a, b) is a rational 
linear combination of the elements of the set {(logp)-' ■ p \ q} (although the rational coefficients 
depend upon q, a, and b). <0> 

Once we determine how to expand the coefficient — a linear combination 



of individual values of we can establish Proposition 3.20 which describes how the cumulant 



Wn{q; a, b)V{q; a, b) can be evaluated in terms of the arithmetic information already defined. 

Lemma 3.19. Let xbe a Dirichlet character (mod q), and let a and b be reduced residues (mod q). 
For any nonnegative integer n, we have 



\x{a)-x{b)r=Y.^-^)i 



Proof. The algebraic identity 



|j|<n 

can be verified by a straightforward induction on n. Since 

|x(a) - xib)? = (x(a) - x{b)) (x(a) - x{b)) = 2 - xiab'') - x{ab-')-\ 

the lemma follows immediately. □ 

Proposition 3.20. Assume GRH. Let a and b be reduced residues (mod q). For any positive integer 
n, the expression Wn{q;a,b)V{q;a,b)/(f){q) can be written as a fixed Q-linear combination of 
elements in the set 

{Cn{q)} U {L,{a'b-') log2, i,{-a'b-') log 2, ..(a^r^To: \^\ < n} 

U {L,{a'b-')aj), Lq{-a'b~%{j): \z\ < n, 2 < j < n] 

u{Hr,,jiq;a,b), Mn,Ma,b): l<j<n}. (3.14) 
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Proof. From the definitions ( |2.9[ ) and p.l4| ) of W„(g; a, h) and &n(x), we have 



Wn{q]a,h)V{q-a,h) 22«|A2„| 



0(g) 



0(g) 



E \x{a)-xih)r E 



1 



X (mod q) 



7>0 
L(l/2+i7,x)=0 



(1/4 + 72)^ 



X (mod ij) 



Lemma 2.8 ^d) tells us that the numbers \2n are rational. Therefore by Lemma 3.15 it suffices 
to establish that three types of expressions, corresponding to the three types of quantities in the 
set p.l2| ), are fixed Q-linear combinations of elements of the set p.l4[ ). 

Typel:^ ^ |x(a) - x(&)r" log ^* 



0(g) 



X (mod q) 



Note that Proposition 3.3 can be rewritten in the form 

Y\ x(a)logg* = 6q(a 



0(g) 



X (mod q) 



.[ . v^logp\ X A(g/(g,a- 1)) 
)yogq~2^j—^j-[l-Lq{a))^y-^— (3.15) 



By Lemma 3.19 and the orthogonality relation (3.1 1, we have 



— y 

0(g) ^ 

x(modi3) 



|i|<n 



2n 

n + i 



0(g/(g,a - 1)) ' 



(3.16) 



Therefore, using equation (3.15 1 and Proposition 3.1 we get 



1 



0(g) 



J2 \x{a)-x{b)r\og 



X (mod q) 



2ni„„ g^ 

vr 



^ E W«)-x(^)l'"log9*-^ E lx(«)-x(&)P"logvr 



0(g) 



X (mod q) 

2n 

n + i 



X (mod q) 

p\q ^ 



\i\<n 

- (1 - ^.(«^^"0)^M^S^ - ^.(«^^"^) l°g^ ) = ^n(^)' 



0(g/(g,a*&-'-i)) 



since (g, a^b * — 1) = (g, a* — 6*). 
Type E lx(a)-x(&)r" 



X (mod q) 



logr(s) 



for some 1 < j <n. 



s={l+i)/2 



The following identities hold for j > 2 (see LL equations 6.4.2 and 6.4.4]): 



ds^ 
d^ 
ds^ 



-AogV{s 
logr(s) 



s=l/2 



-i)^(j-i)!C(j); 



[-1)^(J-1)!C(J)(2^-1) 
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Because ^ = when = 1 ^^id ^ = 1 when x(— 1) = —1, we may thus write 



logr(s) 



J s=(l+S)/2 



(-l)^(j-l)!C(j)(2^'"' + x(-l)(2^'"'-l)), 



whence by equation p.l6| ), 



X (mod q) 



dsj 



Tiogr(s) 



s={l+5)/2 



^ E lx(«)-x(&)r"(-iy(j-l)!C(j)(2^-' + x(-l)(2^-'-l)) 



X (mod g) 



2n 

n + i 



-iy(j-l)!C(j)(2^-^E(-l)^( 

|i|<n ^ 



\i\<n 

which is a linear combination of the desired type. The case j = I can be handled similarly using 
the identity 

logr(s) =-7o- (l + x(-l))log2. 

J s=(l+0/2 



ds 



Type 3:^ |x(a) - xWP" Re 



X (mod ( 



logL(s,x* 



for some 1 < j < n. 



s=l 



The expression in question is exactly M.'^j{q; a, h), and so it suffices to show that A^* ^ (g; a, h) 
M.n,j{q] a, h) + Hnj{q; a, b). Note that the identity 



d^ 



d^ 

-— log L(s,x) , . 
ds^ ^ ^ dsi- 



-f-E 



A(n)x(ri) 



ra=l 



. A(n)(logn)^ V(n 



implies 



d^ 
ds^ 



n=l 



J s=l 



-iyY.^\ogpyY.—^{f)- 

p e=l ^ 



The proof of Lemma 3.9 can then be adapted to obtain the equation 



A^;^.(g; a, b) - MuAt, «, b) = f $^(logp)^' 



oo 



\2n 



p 

e=l X (mod q) 

oo 



2n 



n + i 



E X{a'b-%*{p' 

X (mod q) 



by Lemma 3.19 Evaluating the inner sum by Proposition 3.4 shows that this last expression is 
precisely the definition of Hnj{q; a, b), as desired. □ 



As described at the beginning of this section, Proposition 3.14 is exactly what we need to justify 
the assertion that we can calculate 5{q; a, b), using only arithmetic information, to within an error 
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of the form Oyi(g~^). That some small primes in arithmetic progressions (mod q) enter the calcu- 
lations is not surprising; interestingly, though, the arithmetic progressions involved are the residue 
classes a-'b^^ for |j| < n, rather than the residue classes a and b themselves! 

To give a better flavor of the form these approximations take, we end this section by explicitly 
giving such a formula with an error term better than 0{q^^^'^^'^) for any e > 0. Taking K = 1 in 
Theorem 1 1 . 1 1 gives the formula 



6{q]a,b) 



1 



1 



SW2{q]a,b) 



2 y/27rV{q;a,b) \ 6V{q;a,b) V{q;a,b) 
Going through the above proofs, one can laboriously work out that 

W2{q;a,b)V{q;a,b) 



+ 



p{<iY 



V{q-a,bf/^ 



4^) E |x(«)-x(&)r{(7o + log2+iC(2))+x(-l)(log2 + K(2))} 

X (mod q) 



to which Lemma 3.19 can be applied with n = 2. Combining these two expressions and expanding 
V{q] a, b) as described after equation ( |3.10[ ) results in the following formula: 

Proposition 3.21. Assume GRH and LI. Suppose a and b are reduced residues (mod q) such that 
a is a nonsquare and b is a square (mod q). Then 

p{qf 



5{q;a,b) 



1 



+ 



pio) 

2A/7r0(g)(£(g; a,b) + n{q;a,b)) 
3 



1 



+ 



120(g)£(g;a,6) 

^rf ,,2,A(g)-(6 + 26,(a^r2))(7o + log2 + iC(2)) 
lQ(j){q)C{q;a,by ^ 

- (2i^(-a%-')-8i,{-ab-')){log2+la2)) 
+ 2J'i{q;a,b) + 2H2,i{q; a,b) - J^2{q;a,b) - if2,2(g; a, 6)| 



ogg 



where C2{q) is defined in Definition 3.16 and H2j{q; a, b) is defined in Definition 3.18 and 



£{q; a, b) = C{q) + Kg{a - b) + Lq{-ab-^) log 2 + Ho{q; a, b) + 



7^(g;a,6) 

J'i{q;a,b) 
J^2{q;a',b) 



A(n) 



+ 



q<n<q'^ 
n=ab~^ (mod q) 



n 



E 

'<n<g 
fo-l (m 

A(a26-2) 

a26-2 



E 



q<n<q'^ 
n=ba~^ (mod < 



n 



E 



A(a6-i) A{ba-^) 

ab~^ ba~^ 
A{n) 



q<n<q'^ 
n=l (mod q) 



n 



^ A{ab-^) ^A{ba-^) , A(62a-2) 



6a" 



62a-2 



_ ^A{ab-^) log(a6-i) _ ^A{ba-^) logiba 



ab-^ 
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ba~ 



^ ^ A(62^-2) log(52^- 



62a- 



In all these definitions, expressions such as a% ^ refer to the smallest positive integer congruent 
to a^6~^ (mod q). 



3.5. A central limit theorem. In this section we prove a central limit theorem for the functions 

E{x; q, a) — E{x; q, b) = (j){q){'n-{x; g, a) — 7r(x; q, b))x~^^'^ \ogx. 

The technique we use is certainly not without precedent. Hooley [7J and Rubinstein and Sarnak 
lfT4ll both prove central limit theorems for similar normalized error terms under the same hypothe- 
ses GRH and LI (though each with different acronyms). 

Theorem 3.22. Assume GRH and LI. As q tends to infinity, the limiting logarithmic distributions 
of the functions 

E{x;q,a) ~E{x;q,b) 
^/20(g)logg 

converge in measure to the standard normal distribution of mean and variance 1, uniformly for 
all pairs a, b of distinct reduced residues modulo q. 

We remark that this result can in fact be derived from Rubinstein and Sarnak's 2-dimensional 
central limit theorem [ jTil Section 3.2] for {E{x] g, a), E{x] g, 6)), although this implication is not 
made explicit in their paper. In general, let 

/•oo /"OO 

0x,y(s,t)= / / exp {i{sx + ty))fx,Y{x,y)dxdy 
Jo Jo 

denote the joint characteristic function of a pair {X, Y) of real-valued random variables, where 
/x,y(x, y) is the joint density function of the pair. Then the characteristic function of the real- 
valued random variable X — Y is, 

<f)x-Y{t)=E{exp{tt{X-Y))) 

exp{it{x - y))fx,Y{x, y) dx dy = (t>x,Y(t, -t). 



'0 JO 

The derivation of Theorem 13 . 22 1 from Rubinstein and Sarnak's 2-dimensional central limit theorem 
then follows by taking X and Y to be the random variables having the same limiting distributions 
as E{x; q, a) and E{x; q, b), respectively (which implies that X — Y = Xq.a,b)- 

On the other hand, we note that our analysis of the variances of these distributions has the benefit 
of providing a better quantitative statement of the convergence of our limiting distributions to the 



Gaussian distribution: see equation p.l9[ ) below 



Proof of Theorem \3. 22 Since the Fourier transform of the limiting logarithmic distribution of 
E{x;q,a) — E{x]q, b) is Xq.a,b{v)^ the Fourier transform of the limiting logarithmic distribution 
of the quotient p.l7| ) is Xg.a,biv/ V^4iW^ogq). A theorem of Levy from 1925 im Section 4.2, 



Theorem 4], the Continuity Theorem for characteristic functions, asserts that all we need to show 
is that 

lim Xq,aJ ^=2==) = e~^"/' (3.18) 



g^oo '^'"''^\^2(j){q) \ogq 



for every fixed real number rj. Because the right-hand side is continuous at 77 = 0, it is automat- 
ically the characteristic function of the measure to which the limiting logarithmic distributions of 



the quotients (jXTTj) converge in distribution, according to Levy's theorem. 
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When q is large enough in terms of rj, we have \rj/ y^2(j){q) log q\ < |. For such q. Proposi- 
tion 



2.13 implies that 



r] \ V{q]a,h) if / {c{q,a) - c{q,b))\r]\ V{q;a,b)r]'^ 



v/20(g) logg/ 20(g) log g 2 V ^0(g) log g ((/)(g)logg) 



I Q/' ^^^^S^QS^ I l^l^(^) I (3 19) 

2 V logg ^0(g) logg 0(g) log g/ 

using the asymptotic formula for V (g; a, b) given in Proposition |3]6} Since i] is fixed, this is enough 



to verify p.l8| ), which establishes the theorem. □ 



3.6. Racing quadratic nonresidues against quadratic residues. This section is devoted to un- 
derstanding the effect of low-lying zeros of Dirichlet L-functions on prime number races between 
quadratic residues and quadratic nonresidues. This phenomenon has already been studied by many 
authors — see for instance [3|. Let g be an odd prime, and define n(x] g, A^) = #{j9 < x: p is 
a quadratic nonresidue (mod g)} and 7r(x; g, R) = < a;: p is a quadratic residue (mod g)}. 
Each of 7r(x; g, A^) and 7i{x; g, R) is asymptotic to 7r(x)/2, but Chebyshev's bias predicts that the 
difference 7r(x; g, A^) — 7i{x; g, R), or equivalently the normalized difference 

E{x- N,R) = ^-^ {7i{x- q, N) - 7r(x; g, R)) , 
Jx 



is more often positive than negative. 

Our methods lead to an asymptotic formula for 5(g; A^, i?), the logarithmic density of the set of 
real numbers x>\ satisfying tx{x\ g, A^) > 7r(a;; g, i?), that explains the effect of low-lying zeros 
in a straightfoward and quantitative way. We sketch this application now. 

First, define the random variable 

Xq-N,R = 2 + 2 



7>0 \I-A+T 
L(l/2+i7,xi)=0 



where xi is the unique quadratic character (mod g). Under GRH and LI, the distribution of Xg-^^ji 
is the same as the limiting distribution of the normalized error term E{x; N, R). The methods of 
Section |3] then lead to an asymptotic formula analogous to equation ( |1.2| ): 




where 



V{q;N,R) = b{xi)= TT 



+ 7' 



4 

L(l/2+i7,xi)=0 

To simplify the discussion, we explore only the effect of the lowest zero (the zero closest to the 
real axis) on the size of V{q; N, R). 

By the classical formula for the zero-counting function N{T, x), the average height of the lowest 
zero of L(s, Xi) is 2tt/ log g. Suppose we have a lower-than-average zero, say at height c-27r/ log g 
for some < c < 1. Then we get a higher-than- average contribution to the variance of size 

1 1 



1/4 + (c- 27r/logg)2 1/4 + (27r/ log g)^ 
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Since the variance V{q; N, R) = b{xi) is asymptotically logg by Lemma 3.5 this increases the 
variance by roughly a percentage t given by 

^. W 1 1 A .3 21) 

logg V1/4+ (c- 27r/logg)2 1/4 + (27r/ logg)2y ' 

Therefore, given any two of the three parameters 

• how low the lowest zero is (in terms of the percentage c of the average), 

• how large a contribution we see to the variance (in terms of the percentage t), and 

• the size of the modulus q, 

we can determine the range for the third parameter from equation p.21| ). 



For example, as c tends to 0, the right-hand side of equation p.21| ) is asymptotically 

QAtt^ 



(log q + 167r2) logg 

So if we want to see an increase in variance of 10%, an approximation for the range of q for 
which this might be possible is given by setting 647?^/ (log^ q + IGtt^) logg = 0.1 and solving for 
g, which gives logg = 15.66 or about g = 6,300,000. This assumes that c tends to — in other 
words, that L(s, Xi) has an extremely low zero. However, even taking c = | on the right-hand side 
of equation p.21| ) and setting the resulting expression equal to 0.1 yields about g = 1,600,000. In 



other words, having a zero that's only a third as high as the average zero, for example, will give a 
"noticeable" (at least 10%) lift to the variance up to roughly g = 1,600,000. 

It turns out that unusually low zeros of this sort are not particularly rare. The Katz-Samak model 
predicts that the proportion of L-functions in the family {L(s, x) '■ X primitive of order 2} having a 
zero as low as c ■ 27r/log g is asymptotically 2it'^c^ /9 as c tends to 0. Continuing with our example 
value c = |, we see that roughly 8% of the moduli less than 1,600,000 will have a 10% lift in the 
variance V{q; N, R) coming from the lowest-lying zero. 

Well-known examples of L-functions having low-lying zeros are the L(s, xi) corresponding to 
prime moduli g for which the class number h(—q) equals 1, as explained in ^ with the Chowla- 
Selberg formula for g = 163. For this modulus, the imaginary part of the lowest- lying zero is 
0.202901 . . . = 0.16449 . . . ■ 27r/ log 163. According to our approximations, this low-lying zero 
increases the variance by roughly t = 56%; considering this increased variance in equation p.20| ) 
explains why the value of 5(163; A^, R) is exceptionally low. The actual value of (5(163; N, R), 
along with some neighboring values, are shown in Table [2j 

Other Dirichlet L-functions having low-lying zeros are the L(s, xi) corresponding to prime 
moduli g for which the class number h{—q) is relatively small; a good summary of the first few 
class numbers is given in [3 , Table VI]. 



Table 2. Values of 5(g; A^, R) for g = 163 and nearby primes 



q 


6{q-N,R) 


151 


0.745487 


157 


0.750767 


163 


0.590585 


167 


0.780096 


173 


0.659642 
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Notice that in principle, racing quadratic residues against quadratic nonresidues makes sense for 
any modulus q for which p{q) = 2, which includes powers of odd primes and twice these powers. 
However, being a quadratic residue modulo a prime q is exactly equivalent to being a quadratic 
residue modulo any power of q, and also (for odd numbers) exactly equivalent to being a quadratic 
residue modulo twice a power of q. Therefore 5{q; N, R) = 5{q^] N, R) = 5{2q^] N, R) for every 
odd prime q. The only other modulus for which p{q) = 2 is g = 4, which has been previously 
studied: Rubinstein and Sarnak UJJ calculated that 5(4; A^, R) = 5(4; 3, 1) ^ 0.9959. 



4. Fine-scale differences among races to the same modulus 



In this section we probe the effect that the specific choice of residue classes a and b has on the 



density S{q; a, b). We begin by proving Corollary 1.9 which isolates the quantitative influence of 
5{q; a, b) on a and b from its dependence on g, in Section 4.1 We then dissect the relevant influ- 
ence, namely the function A(g; a, 6), showing how particular arithmetic properties of the residue 
classes a and b predictably affect the density; three tables of computational data are included to il- 



lustrate these conclusions. In Section 4.2 we develop this theme even further, proving Theorem 4.2 



and hence its implication Theorem 1.10 which establishes a lasting "meta-bias" among these den- 
sities. Finally, in Section |43 we apply our techniques to the seemingly unrelated "mirror image 
phenomenon" observed by Bays and Hudson, explaining its existence with a similar analysis. 



4. 1 . The impact of the residue classes a and b. The work of the previous sections has provided 



us with all the tools we need to establish Corollary 1 .9 



Proof of Corollary \1.9\ We begin by showing that the function 

A(g; a, b) = Kg{a - b) + Lg{-ab-^) log 2 + + + H{q; a, b) 



r2 



defined in equation ( |1.4[ ) is bounded above by an absolute constant (the fact that it is nonnegative 
is immediate from the definitions of its constituent parts). It has already been remarked in Defini- 
tion 1 .5 that Kg is uniformly bounded, as is l.^. We also have A(r)/r < (log r) / r, and this function 



is decreasing for r > 3, so the third and fourth terms are each uniformly bounded as well. Finally, 

1 logp 



from Definition 1 1. 8 i we see that 

h{q;p,r 



1 logp 

0(p^) pe(<?;p,r) - p-l pi ' 



< 



and so H{q; a, b) < 2(logp) /p{p — 1) is uniformly bounded by a convergent sum as well. 
We now turn to the main assertion of the corollary. By Theorems 1 1 .4| and 1 1 .7 [ we have 

V{q- a,b) = 20(g) (£(g) + K^ia - b) + igi-ab'') log 2) + 2M*{q; a, b) 



2(f){q) C{q) + Kg{a - b) + tq{~ab-^) log 2 + + 



A(ri) A(r2) 



n 



r2 



+ H{q;a,b) + 



log^g 



2(/)(g) £(g) + A(g;a,6) + 



log q 



20(.)/:(.)fi + ^^ + o 



\ogq 
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Table 3. The densities S{q; a, 1) computed for q = 163 



q 


a 




S{q;a, 1) 


q 


a 


a-i 


S{q;a, 1) 


163 


162 


162 


0.524032 


163 


30 


125 


0.526809 


163 


3 


109 


0.525168 


163 


76 


148 


0.526815 


163 


2 


82 


0.525370 


163 


92 


101 


0.526829 


163 


5 


98 


0.525428 


163 


86 


127 


0.526869 


163 


7 


70 


0.525664 


163 


128 


149 


0.526879 


163 


11 


89 


0.525744 


163 


129 


139 


0.526879 


163 


13 


138 


0.526079 


163 


80 


108 


0.526894 


163 


17 


48 


0.526083 


163 


114 


153 


0.526898 


163 


19 


103 


0.526090 


163 


117 


124 


0.526900 


163 


23 


78 


0.526213 


163 


20 


106 


0.526906 


163 


31 


142 


0.526378 


163 


42 


66 


0.526912 


163 


67 


73 


0.526437 


163 


28 


99 


0.526914 


163 


37 


141 


0.526510 


163 


44 


63 


0.526925 


163 


29 


45 


0.526532 


163 


12 


68 


0.526931 


163 


27 


157 


0.526578 


163 


72 


120 


0.526941 


163 


32 


107 


0.526586 


163 


112 


147 


0.526975 


163 


59 


105 


0.526620 


163 


110 


123 


0.526981 


163 


8 


102 


0.526638 


163 


122 


159 


0.526996 


163 


79 


130 


0.526682 


163 


50 


75 


0.526997 


163 


94 


137 


0.526746 


163 


52 


116 


0.527002 


163 


18 


154 


0.526768 











Since A(g; a, h) is bounded while C{q) ~ logg, we see that V{q) ~ 20(g) logg; moreover, the 
power series expansion of (1 + t)^^/^ around t = implies that 

(Recall that we are assuming that g > 43, which is enough to ensure that £(g) is positive.) Together 
with the last assertion of Theorem [TTT| this formula implies that 



d(g;a, 6) = - + 



V 2C{q) ^"Vlog'J/' I' 



3 



2 2v/vr0(g)£(g) V 2£(g) Khg'qJJ \V{q-a,hfl^ J' 

Since the last error term is g^/ (0(g) log qY^"^, it can be subsumed into the first error term, and 
the proof of the corollary is complete. □ 



Corollary 1 .9 tells us that larger values of A(g; a, h) lead to smaller values of the density (5(g; a, h). 
Computations of the values of (5(g; a, h) (using methods described in Section 5.4) illustrate this re- 
lationship nicely. Since S{q;a,b) = S{q; ab~^ ,1) when 6 is a square (mod g), we restrict our 
attention to densities of the form 5{q; a, 1). 

We begin by investigating a prime modulus g, noting that 

/ N, A(a) A(a"M 2 logg 

A(g;a,l) = .,(-a)log2+^+ ^ ^ ■ 



a a ^ q{q 
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when q is prime (here denotes the smallest positive integer that is a multiplicative inverse of 
a (mod q)). Therefore we obtain the largest value of A(g; a, h) when a = —1 (mod q), and the next 
largest values are when a is a small prime, so that the A (a) /a term is large. (These next large values 
also occur when a^^ is a small prime, and in fact we already know that S{q; a, 1) = S{q; a~\ 1). 
When q is large, it is impossible for both a and to be small.) Notice that A(a)/a is generally 
decreasing on primes a, except that A(3)/3 > A(2)/2. Therefore the second, third, and fourth- 
largest values of A(g; a, 1) will occur for a congruent to 3, 2, and 5 (mod q), respectively. 

This effect is quite visible in the calculated data. We use the prime modulus q = 163 as an 
example, since the smallest 12 primes, as well as —1, are all nonsquares (mod 163). Table|3]lists 
the values of all densities of the form 5(163, a, 1) (remembering that 5{q; a, 1) = a~^, 1) and 
that the value of any a, b) is equal to one of these). Even though the relationship between 



A(g; a, 1) and 5{q; a, 1) given in Corollary lL9 involves an error term, the data is striking. The 



smallest ten values of 6{q; a, 1) are exactly in the order predicted by our analysis of A(g; a, 1): the 
smallest is a = 162 = —1 (mod 163), then a = 3 and a = 2, then the seven next smallest primes 
in order. (This ordering, which is clearly related to Theorem 1.10[ will be seen again in Figure [2j) 



One can also probe more closely the effect of the term M{q; a, 1) upon the density 6{q; a, 1) 
Equation ( |3.10[ ) can be rewritten as the approximation 

M{q;aA) ^ ^ y y Mnl^ y A(n) ^^^^ 

n<y n<y 'n.<y 

n=l (mod q) n=a (mod q) n=a~^ (mod q) 

(where we are ignoring the exact form of the error term). Taking y = q recovers the approximation 
M(g; a, 1) ~ 0(g) (A(a)/a + A(a^^)/a^^) used in the definition of A(g; a, 6), but taking y larger 
would result in a better approximation. 

We examine this effect on the calculated densities for the medium-sized prime modulus q = 101. 
In Table |4} the second group of columns records the first four prime powers that are congruent to 
a or a^^ (mod 101). The second-to-last column gives the value of the right-hand side of equa- 



tion ( |4.1[ ), computed at y = 10 . Note that smaller prime powers in the second group of columns 
give large contributions to this second-to-last column, a trend that can be visually confirmed. Fi- 
nally, the last column lists the values of the densities 5{q] a, h), according to which the rows have 
been sorted in ascending order. The correlation between larger values of the second-to-last column 
and smaller values of a, h) is almost perfect (the adjacent entries a = 40 and a = 10 being 
the only exception): the existence of smaller primes and prime powers in the residue classes a and 
(mod 101) really does contribute positively to the variance V{q] a, 1) and hence decreases the 
density a, 1). (Note that the effect of the term iioi(— ct) log 2 is not present here, since 101 is a 
prime congruent to 1 (mod 4) and hence —1 is not a nonsquare.) 

Finally we investigate a highly composite modulus q to witness the effect of the term Kq{a—1) = 
A(g/(g,a — l))/(f){q/{q, a — 1)) — A(g)/0(g) on the size of A(g;a, 1). This expression vanishes 
unless a — 1 has such a large factor in common with q that the quotient g/(g, a — 1) is a prime 
power. Therefore we see a larger value of A(g; a, 1), and hence expect to see a smaller value of 
S{q; a, 1), when a — 1) is a small prime, for example when a = | + 1. 

Table [5] confirms this observation with the modulus q = 420. Of the six smallest densities 
5(420; a, 1), five of them correspond to the residue classes a (and their inverses) for which q/{q,a — 
1) is a prime power; the sixth corresponds to a = — 1 (mod 420), echoing the effect already seen 
for q = 163. Moreover, the ordering of these first six densities are exactly as predicted: even the 
battle for smallest density between a = — 1 (mod 420) and a = 420 /2 — 1 is appropriate, since both 
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Table 4. The effect of medium- sized prime powers on the densities S{q; a, 1), 
illustrated with q = 101 and y = 10® 



q 


a 




First four 


prime powers 


RHS of (|4T1) 


5(101, a, 1) 


101 


7 


29 


7 






SI 9 


0.563304 


0.534839 


101 


2 


51 


7 




7nq 

/ \jy 


8Sq 

ojy 


0.554043 


0.534928 


101 


3 


34 


J 


J J 1 


81 1 
oil 


1 01 ^ 


0.528385 


0.535103 


101 


11 


46 


1 1 
i i 




fi1 7 


1 071 


0.383090 


0.536123 


101 


8 


38 



o 


1 0Q 


1 


^1 1 


0.332888 


0.536499 


101 


53 


61 


J J 


U i 




4S7 


0.329038 


0.536522 


101 


12 


59 


SQ 

Jy 






71 Q 

J ly 


0.276048 


0.536955 


101 


67 


98 


O / 


1QQ 

lyy 


z,\jy 


401 


0.271567 


0.536993 


101 


41 


69 


4.1 


94.^ 


971 


647 


0.268766 


0.537013 


101 


28 


83 


8^ 

OJ 


J J i 


487 


q^7 


0.235130 


0.537284 


101 


15 


27 


97 


198 


79q 


41 q 


0.235035 


0.537293 


101 


66 


75 


1 f<l 
iu / 


977 


^ J y 


S71 


0.230291 


0.537340 


101 


18 


73 


/ J 


S9^ 


881 
oo i 


1 1 ?q 


0.215281 


0.537463 


101 


50 


99 








7S7 


0.211209 


0.537500 


101 


55 


90 


191 


257 


661 


797 


0.205833 


0.537537 


101 


42 


89 


89 


547 


1153 


1301 


0.202289 


0.537586 


101 


44 


62 


163 


347 


751 


769 


0.199652 


0.537607 


101 


72 


94 


173 


397 


577 


599 


0.196417 


0.537623 


101 


32 


60 


32 


739 


941 


1171 


0.191447 


0.537660 


101 


26 


35 


127 


439 


641 


733 


0.190601 


0.537688 


101 


39 


57 


241 


443 


461 


1049 


0.187848 


0.537708 


101 


40 


48 


149 


343 


1151 


1361 


0.178698 


0.537780 


101 


10 


91 


293 


313 


919 


1303 


0.180422 


0.537792 


101 


74 


86 


389 


983 


1399 


1601 


0.165153 


0.537900 


101 


63 


93 


467 


1103 


1709 


2083 


0.146466 


0.538067 



residue classes cause an increase in A(420; a, 1) of size exactly log 2. (Since 420 is divisible by the 
four smallest primes, the largest effect that the A(a) / a term could have on A(g; a, h) is (log 1 1)/1 1, 
and so these effects are not nearly as large.) The magnitude of this effect is quite significant: note 
that the difference between the first and seventh- smallest values of 5(420; a, 1) (from a = 211 to 
a = 17) is larger than the spread of the largest 46 values (from a = 17 to a = 391). 

4.2. The predictability of the relative sizes of densities. The specificity of our asymptotic for- 
mulas to this point suggests comparing, for fixed integers ai and 02, the densities 5{q] ai, 1) and 
S{q; a2, 1) as q runs through all moduli for which both ai and 02 are nonsquares. (We have already 



seen that every density is equal to one of the form 6{q; a, 1).) Theorem 1.10 which we will derive 



shortly from Corollary 4.3[ is a statement about exactly this sort of comparison. 



In fact we can investigate even more general families of race games: fix rwo rational numbers 
r and s, and consider the family of densities S{q; r + sq, 1) as q varies. We need r + sg to be an 
integer and relatively prime to q for this density to be sensible; we further desire r + sg to be a 
nonsquare (mod q), or else S{q; r + sq, 1) simply equals ^. Therefore, we define the set of qualified 
moduli 

Q{r, s) = {q e N: r + sq e Z, {r + sq, q) = 1; there are no solutions to x'^ = r + sq (mod q)}. 

(Note that translating s by an integer does not change the residue class of r + sg (mod q), so one 
could restrict s to the interval [0, 1) without losing generality if desired.) 
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Table 5. The densities S{q; a, 1) computed for q = 420, together with the values 

of Kg{a - 1) = A(420/(420, a - 1))/0(42O/(42O, a - 1)) 



Q 


a 


a-^ 


fo, a — 1) 


KJa~ 1) 


(Jfa; a, 1) 




a 


a-^ 


((7, a — 1) 


K„{a- 1) 


Jfo: a, 1) 


420 


211 


211 


210 


log 2 


0.770742 


420 


113 


197 


28 





0.807031 


420 


419 


419 


2 





0.772085 


420 


149 


389 


4 





0.807209 


420 


281 


281 


140 


flog 3) /2 


0.779470 


420 


103 


367 


6 





0.807284 


420 


253 


337 


84 


flog 5) /4 


0.788271 


420 


223 


307 


6 





0.807302 


420 


61 


241 


60 


flog 7)/6 


0.788920 


420 


83 


167 


2 





0.807505 


420 


181 


181 


60 


flog 7)/6 


0.789192 


420 


151 


331 


30 





0.809031 


420 


17 


173 


4 





0.795603 


420 


59 


299 


2 





0.809639 


420 


47 


143 


2 





0.796173 


420 


137 


233 


4 





0.809647 


420 


29 


29 


28 





0.796943 


420 


139 


139 


6 





0.810290 


420 


13 


97 


12 





0.797669 


420 


73 


397 


12 





0.811004 


420 


187 


283 


6 





0.797855 


420 


157 


313 


12 





0.811197 


420 


53 


317 


4 





0.798207 


420 


251 


251 


10 





0.811557 


420 


11 


191 


10 





0.798316 


420 


349 


349 


12 





0.811706 


420 


107 


263 


2 





0.798691 


420 


323 


407 


14 





0.811752 


420 


41 


41 


20 





0.800067 


420 


179 


359 


2 





0.811765 


420 


19 


199 


6 





0.800937 


420 


229 


409 


12 





0.811776 


420 


43 


127 


42 





0.801609 


420 


131 


311 


10 





0.811913 


420 


23 


347 


2 





0.802681 


420 


277 


373 


12 





0.812052 


420 


37 


193 


12 





0.803757 


420 


239 


239 


14 





0.812215 


420 


79 


319 


6 





0.804798 


420 


247 


403 


6 





0.812215 


420 


89 


269 


4 





0.804836 


420 


227 


383 


2 





0.812777 


420 


101 


341 


20 





0.805089 


420 


221 


401 


20 





0.813594 


420 
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It turns out that every pair (r, s) of rational numbers can be assigned a "rating" i?(r, s) that 
dictates how the densities in the family r + sg, 1) compare to other densities in similar families. 

Definition 4.1. Define a rating function i?(r, s) as follows: 

• Suppose that the denominator of s is a prime power p'^ (k > 1). 

- If r is a power of the same prime, then R(r, s) = (logp) /(f){p^~^''). 

- If r = 1 or r = l/p' for some integer I < j < k, then R{r, s) = {log p) / (j){p^). 

- If r = 1/p^, then _R(r, s) = (logp)/p^. 

- Otherwise R{r, s) = 0. 

• Suppose that s is an integer. 

- If r = — 1, then R{r,s) = log 2. 

- If r is a prime power {j > 1), then i?(r, s) = {logp)/p' . 

- Otherwise R{r, s) = 0. 

• -R(r, s) = for all other values of s. 

Theorem 4.2. Let A(g; a, b) be defined as in equation ( 1.4| ). For fixed rational numbers r and s, 

A{q;r + sq,l) = Rir,s)+Or ' 
as q tends to infinity within the set Q{r, s). 
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We will be able to prove this theorem at the end of the section; first, however, we note an 
interesting corollary. 

Corollary 4.3. Assume GRH and LI. Ifri, si, r2, S2 are rational numbers such that R{ri, si) > 
R{j2, S2), then 

6{q] ri + siq, 1) < S{q] r2 + S2q, l)for all but finitely many q G Q{ri, Si) n Q{r2, S2). 



Proof. We may assume that q > 43. Inserting the conclusion of Theorem |4.2| into the formula for 

(4.2) 



5{q;a,b) in Corollary 1.9. we obtain 

1 



6{q;r + sq,l) 



2 2^n<j>{q)C{q) 
for any q E Q{r,s). Therefore for all q E Q{ri,si) r\Q{r 2^82), 

-R{ri,si) + R{r2,S2 



6{q; n + siq, 1) - 6{q; r2 + S2q, 1) 



2£(g) 



+ 



p{(l) 



\og^ qj J 2^71 <P{q)C{q)' 



Since the constant — -R(ri, si) + R{r2, S2) is negative by hypothesis, we see that ri + siq, 1) — 
5{q] ^2 + S2q, 1) is negative when q is sufficiently large in terms of ri, si, r2, and S2. □ 

Notice, from the part of Definition 4.1 where s is an integer, that Theorem 1.10| is precisely the 
special case of Corollary |4.3| where si = S2 



0. Therefore we have reduced Theorem 1.10 



to 



proving Theorem 4.2 



Theorem |1.10 itself is illustrated in Figure [2} using the computed densities for prime moduli 
to most clearly observe the relevant phenomenon. For each prime q up to 1000, and for every 
nonsquare a (mod q), the point 

1. VA^) ( log ^) «^ 1) - D - log ^) (4.3) 



Figure 2. Normalized densities 5{q] a, 1) for primes q, using the normalization (4.3 1 
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has been plotted; the values corresponding to certain residue classes have been emphasized with the 
listed symbols. The motivation for the seemingly strange (though order-preserving) normalization 



in the second coordinate is equation (4.2), which shows that the value in the second coordinate is 



—R{a, 0)/2 + 0(1/ log q). In other words, on the vertical axis the value corresponds to 6{q; a, b) 
being exactly the "default" value ^+p{q)/2^yTT(j){q)C{q), the value —0.05 corresponds to 6{q; a, b) 
being less than the default value by 0. 05 p{q)/ 2 ^y7^(j){q)C{q)^, and so on. We clearly see in Fig- 
ure |2] the normalized values corresponding to 5{q; —1, 1), 6{q; 3, 1), 6{q; 2, 1), and so on sorting 
themselves out into rows converging on the values — | log 2, — | log 3, — | log 2, and so on. 



We need to establish several lemmas before we can prove Theorem 4.2 The recurring theme in 
the following analysis is that solutions to linear congruences (mod q) with fixed coefficients must 
be at least a constant times q in size, save for specific exceptions that can be catalogued. 

Lemma 4.4. Let r and s be rational numbers. If r ^ —1 or if s is not an integer, then there are 
only finitely many positive integers q such that r + sq is an integer and r + sq = —1 (mod q). 

Proof. Write r = f and s = ^. The congruence | + = — 1 (mod q) implies that ^ + cq = 
—d (mod q), which means that q must divide ^ + d. This only happens for finitely many q unless 
^ + d = 0, which is equivalent (since 7^ 0) to | = — 1. In this case the congruence is 
^q = (mod q), which can happen only if ^ is an integer. □ 

Lemma 4.5. Let r and s be rational numbers. Suppose that q is a positive integer such that r + sq 
is an integer Ifr ^ 1, then Kq{r + sq — 1) <t^r,s (logg)/?. 

Proof. We first note that A(t)/(l){t) <^ {\ogt)/t for all positive integers t: if A(t) is nonzero, then 
t is a prime power, which means > t/2. Therefore it suffices to show that (g, r + sg — 1) is 
bounded, since then q/{q,r + sq — 1) 3>r,s q and consequently Kq{r + sq — 1) <^r,s 0-Ogq)/q 
since (log t)/t is decreasing for t > 3. But writing r = | and s = ^, we have 

{Qi f + - 1) I (9> d{a - 6) + bcq) = (g, d{a - b)) \ d{a - b). 

Since r 7^ 1, we see that d{a — b) is nonzero, and hence (g, r + sg — 1) < d\a — b\ <tir,s 1 as 
required. □ 

Lemma 4.6. Let r and s be rational numbers. Assume that r is not a positive integer or s is not 
an integer If q and y are positive integers such that r + sq is an integer and y = r + sq (mod g), 
then y ^^.s Q- 

Proof. Suppose first that s is not an integer, and write s = c/d where d > 1. Then s is at least 1/d 
away from the nearest integer, so that sg is at least q/d away from the nearest multiple of g. Since 
y = r + sq — mq for some integer m, we have y > \sq — mq\ — |r| > q/d ~ \r\ '^r,s Q when g is 
sufficiently large in terms of r and s. 

On the other hand, if s is an integer, then r must also be an integer. If r is nonpositive, then the 
least integer y congruent to r (mod g) is g — |r| Q when g is sufficiently large in terms of r. □ 

Lemma 4.7. Let r and s be rational numbers. Assume that either r is not the reciprocal of a 
positive integer or that ^ is not an integer Suppose that positive integers q and y are given such 
that r + sq is an integer and (r + sq)y = 1 (mod g). Then y ^,. 3 g. 

Proof. Write r = | and s = ^ with (a, b) = (c, d) = 1 and b,d>0. We may assume that g > 2d'^, 
for if g < 2d'^ then y > I > 1^ q. Note that a 7^ 0, since + ^g = cannot be invertible 
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modulo q when q > d. The assumption that r + sg is an integer implies that d{r + sq) = ^ + cq 
is also an integer; since (a, b) = 1, this implies that b \ d. Therefore we may write d = bS for some 
integer 6. Similarly, it must be true that b{r + sq) = a + y is an integer; since (c, S) \ (c, d) = 1, 
this implies that g is a multiple of 6. 

Case 1: Suppose first that S = 1. If a = 1, then r = would be the reciprocal of a positive 
integer and f. = y/E would be an integer, contrary to assumption; therefore a 7^ 1. The condition 
(r + sq)y = 1 (mod q), when multiplied by b, becomes ay = b (mod q). Now if a = — 1, then 
the congruence in question is equivalent to y = —b (mod q); since b > 0, this implies that y > 
q — b q as desired. Therefore for the rest of Case 1, we can assume that \a\ > 1. 

Since any common factor of a and q would consequently be a factor of b as well, but (a, 6) = 1, 
we must have (a, g) = 1. Thus we may choose u such that uq = —1 (mod a), so that yo = 
b{uq + l)/a is an integer. We see by direct calculation that yo is a solution to ay = b (mod q), and 
all other solutions differ from this one by a multiple of q/{b, q), which is certainly a multiple of |. 

In other words, y = q{^ + f ) + a ^'^^ some integer 2. If ^ + | = then — 2 = + |) — 2; = ^ 
would be an integer, but this is impossible since both b and u are relatively prime to a (here we 
use \a\ ^ 1). Therefore I — + f I > rrr, and so y > tti — A; since q > 2d'^ = 26^, this gives 

Case 2: Suppose now that 5 > 1. The condition (| + ^Q)y = 1 (mod q) forces {y, q) = I and so 
(y, 5) = 1 as well. Multiplying the condition by b yields ay + cy^ = b (mod q), which we write as 
^ — qm = b — ay for some integer m. But notice that {cy, 5) = 1, so that y is at least | away 
from every integer (here we use 5 > 1); therefore ^ is at least | away from the nearest multiple 
of q. Therefore | < |^ — qm\ = \b — ay\ < b + \a\y, and hence y > {q — b6)/\a\6; since 
q > 2d = 2b5, this gives y > :^r,s q- □ 

Corollary 4.8. Let r and s be rational numbers, and let q be a positive integer such that r + sq is 
an integer 

(a) Assume that r is not a positive integer or s is not an integer Suppose that y is a positive 
integer such that y = r + sq (mod q). Then A{y)/y <^r,s (log q) / q. 

(b) Assume that either r is not the reciprocal of a positive integer or that ^ is not an integer. 
Suppose that y is a positive integer such that (r + sq)y = 1 (mod q). Then A{y)/y <tir,s 
{log q)/q. 

Proof. Since A{y)/y < {\ogy)/y, which is a decreasing function for y > 3, this follows from 
Lemmas IM] and i^Tl □ 

Lemma 4.9. Let r and s be rational numbers. Let q be a positive integer such that r + sq is an 
integer, and let p be a prime such that p'^ \\ q with u >\. 

(a) Suppose that e is a positive integer such that p'^ = r + sq (mod q/p"). Then either p^ = r 
orp"" >r,s q/p"- 

(b) Suppose that e is a positive integer such that p^{r + sq) = 1 (mod qjp^). Then either 
p^ = 1/r orp*^ ^r,s q/p"- 

Notice that if p*^ = r in (a) then sp'' is an integer; also, if p*^ = 1/r in (b) then sp'^^" is an integer. 
In both cases, it is necessary that the denominator of s be a power of p as well. 
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Proof. We may assume that q/p" h sufficiently large in terms of r and s, for otherwise any positive 
integer is ^^^^ q/p" ■ We have two cases to examine. 

(a) We are assuming that p^ = r + sq (mod q/p"). Suppose first that sp" is an integer. Then sq 
is an integer multiple of q/p^ , and p^ = r (mod q/p"). This means that either = r or 

> q/p" + r q/p^ , since q/p" is sufficiently large in terms of r. 
On the other hand, suppose that sp" is not an integer. Then 

p'^ = r + sq = r + {sp^)q/p^ = r + {sp^ — [sp^^J )q/p^ (mod q/p^). 

If the denominator of s is d, then the difference sp^ — [sp'^J is at least ^, and therefore 

> g/c/p*^ + r ^r,s g/p*^ as well, since q/p" is sufficiently large in terms of r and s. 



(b) We are assuming that p^{r + sq) = 1 (mod q/p'^). We apply Lemma 4.7 with q/p^ in 
place of g and with y = p'^, which yields the desired lower bound p^ ^r,s q/p" unless 
r is the reciprocal of a positive integer and ^ is an integer. In this case, multiplying the 
assumed congruence by the integer 1/r gives p^(l + ^g) = l/r (mod q/p^), which implies 
= 1/r (mod q/p") since ^ is an integer. Therefore, since q/p" is sufficiently large in 
terms of r, either p"^ = 1/r or > q/p" + 1/r > q/p" . 

□ 

The next two lemmas involve the functions h{q] p, r) and H(q] a, h) that were defined in Defini- 
tion 1.8 Since we are dealing with rational numbers, we make the following clarification: when 
we say "power of p", we mean p'^ for some positive integer k (so p^ and p^ are powers of p, but 
neither 1 norp~^ is). 

Lemma 4.10. Let r and s be rational numbers, and suppose that q is a positive integer such that 
r + sq is an integer that is relatively prime to q. Let p be a prime dividing q, and choose v > 1 
such that p^ II q. 

(a) If both r and the denominator of s are powers ofp (note that if the denominator ofs equals 
p^, these conditions imply u = k), then 



%;p,(r + .gn = -^ + 0.,(^): 



otherwise h{q;p, (r + sq)~^) <^r,s (}og p)/q. 
(b) If both 1/r and the denominator ofs are powers ofp (note that ifr = l/p' and the denom- 
inator ofs equals p^, these conditions imply u = k ~ j), then 

r\ogp flogp 
hiq;p,r + sq) = j^ + Or 

otherwise h{q;p,r + sq) -Cr^^ (}ogp)/q. 



Proof. (a) Assume p'^ = r + sq (mod q/p"). By Lemma 4.9[ we have that either r = p'^ (which 



implies that the denominator of s is apowerofp), or else h{q;p, (r + sq)^^) ^^.s (}ogp)/q. 
So we only need to compute p, (r + sq)"^) in the case where r is any power of p (say 
r = p'^) and where s has a denominator which is a power of p (say s = c/p^, where z < u 
since g is a multiple of the denominator of s). 

In this case the congruence p'^ = r + sq (mod q/p^) is satisfied. Furthermore, e is the 
minimal such positive integer if q is sufficiently large in terms of r and s. If e is minimal we 
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have h{q;p, (r + sq)~^) = {\ogp) / (t){p'')p^ = (Iogp)/r0(j9'^) by definition; if e is not min- 
imal we have h{q; p, (r + sg)^^) 0-Ogp)/q since there are only finitely many possible 
values of q. In both cases, the proposition is established (the "main term" (logp)/r(f){p'^) is 
actually dominated by the error term in the latter case). 



(b) Assume p'^{r + sq) = 1 (mod q/p'^). By Lemma 4.9 we have that either 1/r = p'^ (which 
implies that the denominator of s is a power of p), or else h{q;p,r + sq) <tir,s 0-Ogp)/q. 
So we only need to compute h{q; p,r + sq) in the case where 1/r is any power of p (say 
1/r = p') and where s has a denominator which is a power of p (say s = c/p'^, where 
- j = z/ > 0). 

In this case the congruence ^^(r+sg) = 1 (mod q/p'^)h satisfied (since sg = (mod q/p^)). 
We can rewrite this congruence as = 1/r (mod qjp^). As above, either j is the minimal 
such positive integer, in which case h(q]p,r + sq) = {\ogp) / (l){p^)p^ = (rlogp) /(f){p'^) 
by definition, or else q is bounded in terms of r and s, in which case h{q;p, r + sq) <^r,s 
(logp)/g. In both cases, the proposition is established. 

□ 

Corollary 4.11. Let r and s be rational numbers, and suppose that q is a positive integer such that 
r + sq is an integer that is relatively prime to q. 

(a) Suppose both r and the denominator of s are powers of the same prime p. Then 



(pijji+f^) ' y q 

where r = p^ and the denominator ofs is p^. 

(b) Suppose both 1/r and the denominator of s are powers of the same prime p, with 1/r < s. 
Then 

where the denominator ofs is p^. 

(c) If neither of the above sets of conditions holds, then H{q; r + sq, 1) <^r,s (log q) / q. 



Proof. We sum the conclusion of Lemma 4.10 over all prime divisors p of q (and, according to 



Definition 1.8 over both residue classes r + sq and (r + sq)~^ for each prime divisor). For each 
such p there is a contribution of 0^,^ ((logp)/g) from error terms, and the sum of all these terms is 
^r,s ^ 'I2p\q logp < (log q)/q. The only remaining task is to consider the possible main terms. 
If ^ — pj ^Yyq denominator of s are powers of the sa me pr ime p, then this prime p must 



divide any q for which r + sg is an integer; hence by Lemma |4.10[ we have p'^ \\ q and the term 
h{q;p, (r + sq)^^) contributes (Iogp)/r0(j9'^) = (logp)/0(p-^+^)~to H{q;r + sq, 1). Similarly, if 
r = 1 /p^ and the denominator p'^ of s are powers of the same prime p with j < k, then this prime p 



must divide any g for which r+sg is an integer (this would be false if j = k); hence by Lemma 4. 10 
we have p^^^ \\ q and so the term h{q;p,r + sq) contributes r {log p)/(j){p'^) = {log p)/(f){p'') to 
H(q; r + sg, 1). For other pairs (r, s), no main term appears, and so the corollary is established. □ 



Proof of Theorem [472] From the definition ( |1.4[ ) of A(g; a, b), we have 



A g; r + sq, 1) = U-{r + sq)) log 2 + Kg{r + sg - 1 + + + H{q; r + sq, 1), 

ri r2 
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where ri and are the least positive integers congruent to r + sg and (r + sq) ^, respectively, 
modulo q. The results in this section allow us to analyze each term individually: 

• If r = — 1 and s is an integer, then Lq{—{r + sq)) log 2 = log 2. Otherwise, iq{—{r + 
sq)) log 2 = for all but finitely many (depending on r and s) integers q by Lemma 4.4 
whence in particular Lq{—{r + sq)) log 2 <Cr-,s (logg)/g. 

• If r = 1, then [r + sq — 1, q) = {sq, q) = q/d where d is the denominator of s, and so 
Kq{r + sq — 1) = k{d)/(j){d) by Definition 1.5, Otherwise, Kq{r + sq—l) <^r,s 0-Ogq)/q 
by Lemma 4.5[ this bound also holds if the denominator d of s is not a prime power, since 
thenA(d)/0(d) = 0. 

• If r is a positive integer and s is an integer, then ri = r for all but finitely many q, in which 
case A(ri)/ri = A(r)/r. Otherwise A(ri)/ri <^r,s 0-Ogq)/q by Corollary 4.8, this bound 
also holds if r is not a prime power, since then A(r)/r = 0. 

Similarly, if r = 1/6 is the reciprocal of a positive integer and ^ = 6s is an integer, then 
6(r + sq) = 1 + {bs)q = 1 (mod g); moreover, 6 will be the smallest positive integer (for all 
but finitely many q) such that 6(r + sq) = 1 (mod q), and so A(r2)/r2 = A(6) /6. Otherwise 
A(r2)/r2 <^r,s (logg)/g by Corollary 4.8, this bound also holds if the reciprocal 6 of r is 

0. Note also that if 6 is a prime power, then the 

sq are both integers. 



not a prime power, since then A{b)/b 
denominator of s must be the same prime power, since 6s and r 



• Corollary 4.1 1 tells us exactly when we have a contribution from H{q; r + sq, 1) other than 
the error term Or,<;((logg)/g): the denominator of s must be a prime power, and r must 
be either a power of the same prime or else the reciprocal of a smaller power of the same 
prime. 

In summary, there are six situations in which there is a contribution to A(g; r + sq, 1) beyond the 
error term Or,s((logg)/g): four situations when the denominator of s is a prime power and two 
situations when s is an integer. All six situations are disjoint, and the contribution to A(g; r + sq, 1) 
in each situation is exactly R{r, s) as defined in Definition 4.1 This establishes the theorem. □ 



4.3. The Bays-Hudson "mirror image phenomenon". In 1983, Bays and Hudson [2J pub- 
lished their observations of some curious phenomena in the prime number race among the reduced 
residue classes modulo 11. They graphed normalized error terms corresponding to 7r(x; 11, 1), 
. . . , 7t(x; 11, 10), much like the functions E{x; 11, a) discussed in this paper, and from the graph 
they saw that the terms corresponding to the nonsquare residue classes tended to be positive, 
while the terms corresponding to the square residue classes tended to be negative, as Chebyshev's 
bias predicts. Unexpectedly, however, they noticed [|2l Figure 1]) that the graph corresponding to 
7r(x; 11, 1) had a tendency to look like a mirror image of the graph corresponding to 7r(x; 11, 10), 
and similarly for the other pairs 7r(x; 11, a) and 7r(x; 11, 11 — a). They deemed this observation the 
"additive inverse phenomenon"; we use the physically suggestive name "mirror image phenome- 
non". 

This prompted them to graph the various normalized error terms corresponding to the sums 
7i{x; 11, a) + n(x; 11, 6) where a is a nonsquare (mod 11) and 6 is a square (mod 11); all such 
normalized sums have the same mean value. They witnessed a noticeable difference between the 
cases a + b = 11, when the graph corresponding to the sum was typically quite close to the average 
value (as in [|2l Figure 2]), and all other cases which tended to result in more spread-out graphs. 

The ideas of the current paper can be used to explain this phenomenon. We consider more 
generally the limiting logarithmic distributions of the sums of error terms E(x; q, a) + E(x; q, 6), 
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where a is a nonsquare (mod q) and 6 is a square (mod q). The methods of Section 2.1 are easily 
modified to show (under the usual assumptions of GRH and LI) that this distribution has variance 



V^{q-a,h)= J2 \x{a) + x{b)\'b{x). (4.4) 



X (mod q) 



Following the method of proof of Theorem 1.4 one can show that for any modulus q and any pair 
a, b of reduced residues modulo q, we have 

\ogp A{q) , 1 o A 

(7o + log27r) 



V+(g;a,6) = 20(g)(logg-5^^ 



<P{q) 



- K,{a -h) - i,{-ah~^)\og'2}j + 2M+(g; a, 6) - 46(xo), (4.5) 



where 



^ L(l,x*) 



X (mod q) 

In particular, we note the term —iq[—a\r^) log 2; many of the other terms vanish or simplify in 
the special case that q is prime. We also note that the primary contribution to M'^{q; a, h) is the 
expression — A(ri)/ri — A(r2)/r2, where ri and r2 are the least positive residues of ah^^ and 
ha^^ (mod q). Both of these expressions are familiar to us from our analysis of V{q] a, h), although 
their signs are negative in the current setting rather than positive as before. 

We see that the variance V+(g;a,6) of this distribution E{x;q,a) + E(x;q,b) is somewhat 
smaller than the typical size if there is a small prime congruent to ab~^ or ba^^ (mod g); more 
importantly, it is smallest of all if —ab^^ = 1 (mod q), which is precisely the situation a + b = q. 
In other words, we see very explicitly that the cases where a + b = q yield distributions with 
smaller-than-normal variance, as observed for q = 11 by Bays and Hudson. In particular, our 
theory predicts that for any prime q = 3 (mod 4) (so that exactly one of a and —a is a square), the 
graphs of E(x; q, a) and E{x]q,q — a) will tend to resemble mirror images of each other, more 
so than the graphs of two functions E{x; q, a) and E{x; q, b) where a and b are unrelated. On the 
other hand, the contribution of the Lg term is in a secondary main term, and so the theory predicts 
that this mirror-image tendency becomes weaker as q grows larger. 

We can use the numerical data in the case q = 11, computed first by Bays and Hudson, to 
confirm our theoretical evaluation of these variances. We computed the values of each of the 
twenty-five functions E(x; 11, a) + E(x; 11, b), where a is a square and b a nonsquare (mod q), 
on 400 logarithmically equally spaced points spanning the interval [10^, 10^]. We then computed 
the variance of our sample points for each function, in order to compare them with the theoretical 



variance given in equation ( |4.5[ ), which we computed numerically. It is evident from equation ( |4.4| ) 
that multiplying both a and b by the same factor does not change V~^{q; a, b), and therefore there 
are only three distinct values for these theoretical variances: the functions E(x; q, a) + E(x; q, b) 
where a + b = 11 all give the same variance, as do the functions where ab^^ = 2 or ab~^ = = 
6 (mod 11), and the functions where ab^^ = 7 or ab^^ = 7^^ =8 (mod 11). Table |6] summarizes 
our calculations, where the middle column reports the mean of the variances calculated for the 
functions in each set. 
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Table 6. Observed and theoretical variances for E{x; 11, a) + E{x; 11, b) 



Set of functions 


Average variance calculated 


Theoretical 


E{x; 11, a) + E{x] 11, b) 


from sampled data 


variance 


a + b = 11 


5.60 


5.31 


{ab~\ba-^} = {2,6} (mod 11) 


7.10 


6.82 


{ab-\ba-^} = {7,8} (mod 11) 


9.59 


9.06 



Looking directly at the definition ( |4.4| ) of V^{q; a, b), we see that when a = —b (mod q), the 
only characters that contribute to the sum are the even characters, since we have x(a) + xib) = 
x{a) + x(— l)x(a) = when x(— 1) = —1. As seen earlier in Lemma 3.5 the quantity b{x) is 
smaller for even characters than for odd characters, which is another way to express the explanation 
of the Bays-Hudson observations. 



5. Explicit bounds and computations 

We concern ourselves with explicit numerical bounds and computations of the densities 5{q; a, b) 
in this final section. We begin in Section 5.1 by establishing auxiliary bounds for V{z), for h-{s, x), 



and for the number of zeros of L(s, x) near a given height. In Section 5.2 we use these explicit 
inequalities to provide the proofs of two propositions stated in Section |3.3[ we also establish com- 
putationally accessible upper and lower bounds for the variance V{q; a, b). Explicit estimates for 



the density 5{q; a, b) are proved in Section 5.3[ including two theorems that give explicit numerical 



upper bounds for 5{q; a, b) for q above 1000. Finally, in Section 5.4 we describe the two meth 



ods we used to calculate numerical values for 5{q; a, b); we include some sample data from these 
calculations, including the 120 largest density values that ever occur. 

5.1. Bounds for classical functions. The main goals of this section are to bound the number of 
zeros of L{s, x) near a particular height and to estimate the size of j^{s, x) inside the critical strip, 
both with explicit constants. To achieve this, we first establish some explicit inequalities for the 
Euler Gamma-function. 



Proposition 5.1. IfRez> |, then 

logr(2;) - 

and 

T'{z) 



1\ 1 
z - ^yogz + z - - \og2'n 



< 



4kl 



r ^) 



- \og{z + 1) + 



2z 



1 1 

"+2 ^ ^ 



< 0.2. 



Proof. The first inequality follows from [9, equations (1) and (9) of Section 1.3], both taken with 
n = 1. As for the second inequality, we begin with the identity [.16. equation (21)], taken with 
a = 1: 



"^{z + l) = log(z + 1) - 



2(^ + 1) 



Here "^{z) 



jriz) has its usual meaning; we use the identity 



L'-^I-.V to obtain 
r{z+i) 



1 
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and therefore it suffices to show that < 0.2 when Re (2;) > |. The notation fi{z) = 

logFi_i/2(2;) is defined in [ll6i equation (9)], and therefore f[{z) = F[ ^i^{z) / Fi^i/2{z). By [fT6l 



Lemma 1.1.1], the denominator -Fi 1/2(2) is bounded below in modulus by a/c/tt; by [16, Lemma 
2.2.1] taken with a = n = 1, the numerator is bounded above in modulus by 

X + 1 1 

x + 1/2 " 2x + 2' 



I F' 

r 1,1 



/2\ 



< log 



Re z (unfortunately fVB, equation (27)] contains the misprint /f" ^2 where -F|"/2 



where x 

intended). The right-hand side of this inequality is a decreasing function of x, and its value at 
X = I is log I — |. We conclude that for Re z > |, we have \ f'i{z)\ < ( log | — |) / v^c/tt < 0.2, 
as needed. □ 

Lemma 5.2. Let a = Q or a = 1. For any real numbers 7 < cr < 1 and T, we have 



T'iUa + tT + a)) T'a{2 + tT + a)) 



TiUa + tT + a)) 



r(|(2 + zT + a)) 



< 7.812. 



(5.1) 



Proof. By symmetry we may assume that T > 0. We first dispose of the case T < 3. When a = 0, 
a computer calculation shows that the maximum value of the left-hand side of equation ( |5.1| ) in 
the rectangle {a + iT : \ < a < 1, < T < 3} occurs at a = | and T = 0: the value of the 
left-hand side at that point is a bit less than 7.812. When a = 1, a similar calculation shows that 
the left-hand side of equation ( |5.1[ ) is always strictly less than 7.812. 

For the rest of the proof, we may therefore assume that T > 3. By Proposition |5T| 

r'(|(or + ir + a)) r'(i(2 + 2T + a)) a + iT + a + 2 1 2 

log 7^ 



r(|(a + ir + a)) Ta{2 + iT + a)) 



and therefore 



T'ilia + iT + a)) 
T{Ua + tT + a)) 



log 



r'(^(2 + zT + a)) 



A + iT + a 



+ 



1 



A + iT 



a + iT + a + 2 a + iT + a 
+ 0(0.4), 



2 + iT + a 



+ 



T{^{2 + tT + a)) 
2 



< 


log 







a 



a 



A + iT + a 
2 



2- a 



[a + iT + a + 2){A + iT + a 

Under the assumptions on a, a, and T, we always have the inequality 
The maximum modulus principle implies the inequality I - log(l — z) 



+ 0.4. 



a + iT + a){2 + iT + a 

2-a/{A + iT + a)\ < \ 



< log 4 for \z\ < \, and 



so 



r'(^(^ 



iT + a)) 



V'a{2 + iT + a)) 



iT + a)) 



+ 



T{\{2 + iT + a)) 
2 



< 



a 



A + iT 



log 4 



a 



a 



{a + tT + a + 3){5 + tT + a 
Finally we use the inequalities on cr, a, and T to conclude that 



{a + iT + a){2 + iT + a) 



+ 0.4 



r'(|(a + zT + a)) VaC^ + tT + a)) 



T{l{a + iT + a)) r(|(2 + zT + a)) 
which amply suffices to finish the proof. 



+ 



2 2 
<-log4+^ 
~ 5 5^13 3V13 
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+ 0.4 < 1.4353, 



□ 



We turn now to estimates for quantities associated with Dirichlet L-functions. The next few 
results do not require GRH to be true, and in fact their proofs cite identities from the literature that 
hold more generally no matter where the zeros of L(s, x) might lie. Accordingly, we use the usual 
notation p = /3 + i'j to denote a nontrivial zero of L(s, x), and all sums in this section of the form 
J2p denote sums over all such nontrivial zeros of the Dirichlet L-function. 

Lemma 5.3. Let q > 2, and let xbe a nonprincipal character (mod q). For any real number T, 

1 1 



< 



2 + iT-p|2 2 



log (0.609g(|T| + 5)). 



Proof. It suffices to prove the lemma for primitive characters. For x primitive, it is known [[TTl 
equation (10.37)] that as meromorphic functions on the complex plane. 



L'{s,x) 
L{s,x) 



1 



s + a)) 



1, 

2 °^7r 2Ta{s + a)) 



+ B{x) + Yl 



1 



(5.2) 



where the constant B{x) was described earlier in the proof of Lemma [33} and where a = if 

= 1 and a = 1 if = 
we obtain after rearrangement 



1 



1. Taking real parts of both sides and using the identity p.3| ), 

(5.3) 



s-p L{s,x) 2 ^TT 2 Tilis + a)) 



If we put z = I (s + a) in Proposition 5. 1 , we see that for Re s > |, 

1 

Re 



2 ^ 



Re -#±4^ 



Re log 



s + a + 2 



s + a + 2 



Re 



s + a 



< log|s + a + l| -log2 + + 0.2 < log|s + 3 
Inserting this bound into equation ( |5.3[ ) and putting s = 2 + iT, 

1 



2 + iT 



L'{2 + tT,x) , 1, g^l, 1,^.^1 
- < Re . + - log - + - log 5 + tT\ 

p L{2 + tT,x) 2 vr 2 



0.2 

- 0.493. 

- 0.246. 



Now notice that 



L'{2 + tT,x) 



L{2 + tT,x) 



n=l 



x(n)A(n) 



n 



2+iT 



< 



E 

n=l 



A(n)_ C{2) 
n2 C(2) 



< 0.57, 



(5.4) 



and therefore 

ReE 



1 



2 + iT -p 



0.246 
1 



< 0.57+^ log- + Jlog|5 + zT| 

2 TT 2 

< llogg+^log(|T| + 5) + 0.57 

< ^log(g(|T|+5)) -0.248 < ^log (0.609g(|T| + 5)). 
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logTT - 0.246 



We obtain finally 



|2 + 2T-pP 



< 



2-/3 
\2 + tT-p\^ 



E 2 + ,T-p - ^ (0-609g(|r| + 5)) 



as claimed. 

Proposition 5.4. For any nonprincipal character x <^nd any real number T, we have 

#{p: |T - Imp| < 2} < 41og (0.609g(|T| + 5)). 

Proof. This follows immediately from equation ( |5.5[ ) and the inequalities 

1 , 2-/3 



(5.5) 

□ 



|T-7l<2 



(2 - a)2 + (T - 7)^ 



< 



|2 + zr-p|^ 



□ 



Lemma 5.5. Let s = a + iT with | < cr < 1. For any primitive character x (mod q) with q > 2, 
if Lis, x) 7^ then 

^^7^- y — < V21og(0.609g(|T|+5))+4.48. 

|r-7l<2 

Proof. Applying equation ( |5.2[ ) at s = a + iT and again at 2 + iT, we obtain 

L'{s,x) L'{2 + tT,x) _ i r(l(2 + 2T + a)) i T'(l{s + a)) 
L{s,x) L{2 + iT,x) 2r(|(2 + zT + a)) 2T{\{s + a)) 

which implies 



+E 



1 1 

s — p 2 + iT — p 



E 



1 



|r-7l<2 



< 



L'{2 + iT,x) 



L{2 + tT,x) 



|r-7l>2 



r'(i(2 + zT + a)) r(|(s + a)) 



r(^(2 + ^r + a)) r(i(s + a)) 

^ ' E 

|r-7l<2 



s — p 2 + iT — p 



\2 + iT-p\' 



Using equation ( |5.4[ ) and Lemma 5.2 to bound the first two terms on the right-hand side, we see 
that 



L'{s,x) 



E ^ 



\T-l\<2 



< 0.57 + 3.906 



+ E 



2- a 



+ E 



|T-7l>2 



\s-p\\2 + tT-p\ ' ^ \2 + zT-p\ 

|T-7|<2 
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. (5.6) 



To prepare the last two sums for an application of Lemma|5.3[ we note that when |T — 7I > 2, 

2- a 



< 2 



\s — p\\2 + iT — p 
on the other hand, when |T — 7I < 2, 

1 



\2 + iT - 


- P\ 


1 




s- p 




\2 + iT - p 


2 



< 2V2- 



1 



\2 + iT- p\ 



< 



2V2 



2 + tT - p\ 



12 ■ 



\2 + iT-p\ \2 + iT-p\^ \2 + iT-p\ 
Therefore equation ( |5.6[ ) becomes, by Lemma 5.3 , 

^'^"'^^ y — < 0.57 + 3.906 + 2v^y- 



Lis,x) 



\T-l\<2 



< 



4.48 + y21og (0.609g(|T| + 5)) 



as claimed. 



□ 



We restore the assumption of GRH for the last proposition of this section, which is used in the 
proof of Lemma 5.10 below. 

Proposition 5.6. Assume GRH. Let s = a + iT with \ < c < 1, a ^ \. If x i^ <^ny nonprincipal 
character (mod q), then 

L'is,x) 



L{s,x) 



^ (^3^ + ^) (0-609g(|T| + 5)) + 4.48 + 



Furthermore, ifx is primitive and q > 2, then the summand (log q) / {2'^ — 1) can be omitted from 
the upper bound. 



Proof. Assume first that x is primitive. Lemma 5.5 tells us that 



L'{s,x) 



L{s,x) 



< ^ ^ +y21og(0.609g(|T| +5)) +4.48 



|T-7l<2 



\s- p 



< 1 n Hp- |T - 7I < 2} + v^log (0.609g(|T| + 5)) + 4.48 



under the assumption of GRH; the proposition for primitive x iiow follows immediately from 
Proposition |5.4[ 

If X is not primitive, then L{s,x) = L{s,x*) Y[p\q (l ~ ^^^) ' ^^^^ have the identity 

L'{s,x) ^ L'{s,x*) ^ sp X*{p)^ogP 

L{s,x) L{s,x*) I 

p\q 



Therefore 



L'{s,x) L\s,x*] 



L{,s,x) L{s,x*) 



<y^<^yiogp<^. 

-A^ per _i-2<^-l^ - 2^" - I 

p\q p\q 



which finishes the proof of the proposition in full. 
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□ 



5.2. Bounds for the variance V{q; a, h). This section has two main purposes. First, we provide 
the proofs of Propositions 3.10 and 3.11, two statements involving smoothed sums of the von 
Mangoldt function which were stated in Section |3.3[ Second, we establish two sets of upper and 
lower bounds for the variance V{q] a, h), one when q is prime and one valid for all q. All of these 
results are stated with explicit constants and are valid for explicit ranges of q. 

Lemma 5.7. For any real number t, we have \-j^\V (^—]^ + it)\ \ < \V'{—^+it)\. 

Proof. We show more generally that if f{t) is any differentiable complex- valued function that 
never takes the value 0, then |/(t)| is also differentiable and | ;|: 1/(^)1 1 < \ f'{t)\ \ the lemma then 
follows since V never takes the value 0. Write f{t) = u{t) + iv{t) where u and v are real-valued; 
then 

\/u[tY + v[tY 

while \ f'{t)\ = \u'{t) + iv'{t)\ = ^Ju'{tY + v'{ty. The asserted inequality is therefore equiva- 
lent to \u{t)u'{t) + v{t)v'{t)\ < \/u{tY + v{tY ^^u' [ty + v'{ty, which is a consequence of the 
Cauchy-Schwarz inequality. □ 

Lemma 5.8. We have \T{s)\ < \T(Res)\ for all complex numbers s. 

Note that this assertion is trivially true if Res is a nonpositive integer, under the convention 

\T(-n) \ = ooforn > 0. 



Proof. We prove that the assertion holds whenever Re s > —n, by induction on n. The base case 





n = can be derived from the integral representation r(s) = *dt, which gives 



f oo 

|r(s)| < / \f-^\e~'dt= I t^'''-^e-'dt = TiRes). 
Jo Jo 

Now assume that the assertion holds whenever Re s > —n. Given a complex number s for which 
Re s > —{n + 1), we use the identity T{s + 1) = sT{s) and the induction hypothesis to write 

ir(,)| = Ml±M < . ^ir(Re.)| < |r(Re.)|. 

as desired. □ 
Lemma 5.9. For any nonprincipal character x, 



5Z |r(-^ +^7)1 < 14.271ogg + 16.25. 



7eiR 

L(l/2+i7,x)=0 



We remark that this lemma does not assume GRH, since the sum on the left-hand side only de- 
creases if some of the zeros of L(s, x) lie off the critical line. 



Proof. First, by Proposition 5.4 applied with T = 0, the number of zeros of L(s, x) with I7I < 2 



is at most 41og(3.045g); thus by Lemma [5^ 



E |r(-i + n)l<|r(-i)| E i 

l7l<2 |7l<2 
L{l/2+i7,x)=0 I,{l/2+i7,x)=0 



< 8v^log(3.045g) < 14.18 log g + 15.79. (5.7) 
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We can write the remainder of the sum using Riemann-Stieltjes integration as 

/■oo 

E |r(-^ + ^7)| \T{-l+tt)\d{Nit,x)-Ni2,x)) 



l7l>2 
L(l/2+i7,x)=0 



{N{t,x)-N{2,x))f^\r{-l + it)\dt; 



the vanishing of the boundary terms is justified by the upper bound N(t, x) t log t (see Propo- 
sition 2.15|for example) and the exponential decay of r(s) on vertical lines. We conclude from 



Lemma that 



/■oo 

J2 |r(-i + ^7)| <y^ Nit,x)\r'{-'^ + tt)\dt 



l7l<2 
L{l/2+i7,x)=0 



< 



- + 0.68884 ) log — + 10.6035 ) |r'(-i + it) I dt 
TT / 27re ' ' ^ ^ ' 



by Proposition 2.15 Since log(gt/27re) = log g + log(t/27re), the right-hand side is simply a linear 
function of log q; using numerical integration we see that 



Yl |r(-| +i7)| < 0.091ogg + 0.46. 



l7l<2 
L(l/2+i7,x)=0 



Combining this upper bound with the bound in equation ( |5.7| ) establishes the lemma. 
Lemma 5.10. Assume GRH. For any nonprincipal character x, 

L'{s + l,x)- 



□ 



-3/4+joo 



3/4-joo 



L{s + l,x) 



T s) 



ds < 101 log g+ 112. 



Proof. Proposition 5.6 with cr = j tells us that for any real number t. 



L'{l+tt,x) 



L{\+it,x) 



< 17.42 log (0.609g(|t| + 5)) + 4.48 + 

< 22.711ogg + 17.42 log(|t| +5) -4.159, 



and therefore 



-3/4+ioo 



-3/4-ioo 



L'{s + l,x) 
L{s + l,x) 



Tis] 



ds< (22.711ogg+17.421og(|t| + 5)-4.159)|r(-| + it)| rft. 



Again this integral is a linear function of log q, and a numerical calculation establishes the partic- 
ular constants used in the statement of the lemma. □ 

With these lemmas in hand, we are now able to provide the two proofs deferred until now from 
Section |33l 



Proof of Proposition 3.10 We begin with the Mellin transform formula, valid for any real number 

c> 0, 



E 

n=l 



n 2'Ki 



c+ioo T I 



L\s + l,x) 
L{s + l,x) 



T{s)y' ds 
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(see [fm equations (5.24) and (5.25)]). We move the contour to the left, from the vertical line 
Re s = c to the vertical line Re s = — |, picking up contributions from the pole of F at s = as 
well as from each nontrivial zero of L{s, x). The result is 



n=l 



^ n L(l,x) K 2 fjy 



/2+iy 



7ei 

L(l/2+i7,x)=0 



+ 



1 r'^/^-^'^ L'{s + i,x) 



27ri i_3/4_ioo L{s + l,x) 



T{s)y'ds (5.8) 



since we are assuming GRH. (Strictly speaking, we should consider truncations of these infinite 
integrals; however, the exponential decay of T(s) in vertical strips implies that the contributions at 
large height do vanish in the limit.) 

The sum on the right-hand side can be bounded by 

L(l/2+j7,x)=0 L(l/2+i7,x)=0 

by Lemma 15^ while the integral can be bounded by 



y 



1/2 



1 /-3/4+*°o 2,^(3^ 1^^) 

L{s + l,x) 



2ni 



3/4-400 



T{s)y' ds 



< 



1 



— 3/4+ioo 



3/4-joo 



< 



27r?/3/4 



101 log g + 112 

27ry3/4 



L\s + l,x) 



Lis + l,x) 



Tis] 



ds 



by Lemma 5.10 Using these two inequalities in equation (|5.8[) establishes the proposition. □ 



Proof of Proposition 3.11 Since 1 < a < g, we may write 

A(n) „/g2 _ A(a) „/g2 



E 



+0 



n=a (mod q) 



n 



q<n<q'^ 
n=a (mod q) 



n>q'' 



(5.9) 



n=a (mod q) 

Since A{n)/n < (log n)/n, which is a decreasing function of n for n > 3, we have 



A{n) 



n 



21ogg _i 1 

— 7. — e 



fc=0 



1 -e-1/5' 



n=a (mod g) 

note here that 1 < a < g so g > 2. As the function tj (1 — e^*) is bounded by 1/2(1 — e^^/^) for 
< t < |, we conclude that 



AH^-„/g2 . 21ogg 1 1_ 



logg 



n>q 



< -^^^e-'— — TTTv < 0.935 

n q 2(1 — e ^/"') g 

n=a (mod g) 

We bound the second term of equation ( |5.9| ) crudely: 

A(n 

n - - - - QT + a a J 



E 



q<n<q^ 
n=a (mod g) 



2nV^ 1 ^ 21ogg 1 21ogg 

< logg V . , < V- < logg + 1 . 

j-^ qj + a (1 
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Finally, for the first term of equation ( |5.9| ), the estimate e * = 1 + 0(t) for t > allows us to write 



A(a) 



'O-lq 



A(a) 



1 + 



A(a) 



+ 



logg 
9 



Using these three deductions transforms equation ( |5.9[ ) into the statement of the proposition. □ 

We now turn to the matter of giving explicit upper and lower bounds for a, 6). In the case 
where q is prime, we are already able to establish such estimates. 

Proposition 5.11. If q> 150 is prime, then 

2(g - 1) (log q - 2.42) - 47.238 log^ q < V{q; a,b) <2{q- 1) (log q - 0.99) + 47.238 log^ q. 

Proof. Combining Theorem [L4 with Proposition 3.12[ we see that 

V{q; a, b) = 20(g) (£(g) + Kg{a - b) + Lgi-ab"^) log 2) + 2M*(g; a, b) 

= 2ct>{q) (c{q) + Kg{a - b) + i,{-ab-^) log 2 + + ^ + Ho{q; a, b) 



r2 



+ 



47.2380(g) log^ g 



(5.10) 



for any g > 150, where ri and r2 denote the least positive residues of ab ^ and ba ^ (mod g). 
Since we are assuming g is prime, both Kg{a — b) and HQ^q; a, b) vanish, and we have 



V{q-a,b) = 2{q-l)[ log 



Hri) , A(r2)\ 



, 6,(-ar^)log2 + :^ + :^) +0(47.2381og^g). 

The function A{n)/n is nonnegative and bounded above by (log3)/3, and the function tg takes 
only the values and 1 ; therefore the quantity in large parentheses satisfies the bounds 



logg-2.42<log;^ + .,(-ari)log2 ' ^^''^ ' ^^''^ 



ri 



r2 



< log g- 0.99, 



27reTo 

which establishes the proposition. □ 
We require two additional lemmas before we can treat the case of general (possibly composite) g. 



Lemma 5.12. With Hq defined in Definition 3.8 we have — (41ogg)/g < HQ{q; a, b) < A.hQ for 
any reduced residues a and b (mod g). 



Proof. Since e(g; p, r) > 1 always, we have 

1 logp 



ho{q;p,r) 



^ 1 logp ^ ^ logp 



p^ 



Therefore 



Ho{q;a,b) <^{hoiq;p,ab ^)+ho{q;p,ba ^))<8^ 



p\q 



logp 

p2 



< 



n=l 



A{n) 



C(2) 
C(2) 



< 4.56, 
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which establishes the upper bound. On the other hand, note that p'^(9;p,i) ^n integer larger than 1 
that is congruent to 1 (mod q/p"). Therefore p'^'^'^'^'^) > q/p'' + 1, and so 



H,{q, a, 6) > -2 5^ ho{q, P, 1) = -2 "7^ 



logp 



^(pv\ pe(q;p,l) _ I 
p\q p\q 



^ logp 4^ 41ogg 

> -2 > TT^—, — > > logp> 

- ^ p-{l - 1 p) q p- - q^ 

p\q p\q 



which establishes the lower bound. □ 
Lemma 5.13. Ifq>2is any integer, then 

y^\ogp_ ^ 1 02 log log g + 3.04. 
^ p-l 

p\q 

Proof. We separate the sum into two intervals at the point 1 + logg. The contribution from the 
larger primes is at worst 

logp ^ 1 >r^, ^ logg 

V 7 < ] > logp < :j = 1. 

^-^ p — 1 log q ^-^ log q 

p\q p\q 
p>l+logiJ 

For the smaller primes, recall the usual notation = J2p<t logP- We will use the explicit bound 
9{t) < 1.01624t for t > from Theorem 9 of [[T3l , and so the contribution from the smaller 
primes is bounded by 

p-i - p-i J t-i 

p\q ^ p<l+logg^ 



^(1 + iogg)^ z-^+^-g^ ejt) 



p<l+logg 

/1 + logg /•^+'°s<? 
< 1.01624 — ^+/ — — 

V log? J2 (^-1. 

= 1.01624(1 + + loglogg- + 1 

V log Q log q 

= 1.01624 log log g + 2.03248, 
which finishes the proof of the lemma. □ 
Proposition 5.14. Ifq> 500, then 

20(g)(logg- 1.02 log log g- 7.34) < V{q;a,b) < 20(g)(logg + 6.1). 
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Proof. We begin with equation ( |5.10| ), expanding the functions C and Kq according to Defini- 
tion O 

\ogp A{q/{q,a-b)) 



V(4ia,6) = 20(,)(log^-^^ 



- - b)) 



+ Lq{-ab-')\og2 + + + Ho{q;a,b) + 0[ — — ]. (5.11) 

ri ra \ Q J J 

The last term on the first line is nonnegative and bounded above by log 2, while the first three 
terms on the second line are nonnegative and bounded together by log 2 + | log 3 as in the proof 
of Proposition |5?TTj The term HQ{q; a, b) is bounded above by 4.56 and below by (—4 log q)/qhy 
Lemma l5.12[ Therefore 

2m ( log , - log 2.e- - E ^ - + ) < V(,i a. i) 

< 20(g) ^logg - log 27re^° + log2 + log2+ ^ logs + 4.56 + 1^ ^^'^^^^"^ (5.12) 



The sum being subtracted on the top line is bounded above by 1.02 log log g + 3 . 04 by Lemma 5.13 
Lastly, a calculation shows that the O error term is at most 1.83 for q > 500, and therefore 

20(g) 1^ log g- log 27re^« - (1.02 log log g + 3.04) - - 1.83^ < V{q;a,b) 

< 20(g) ( log g - log 27re^° + log 2 + log 2 + | log 3 + 4.56 + 1.83) , 
which implies the assertion of the proposition. □ 

5.3. Bounds for the density 6{q; a, b). We use the results of the previous section to obtain explicit 
upper and lower bounds on S{q; a, b); from these bounds, we can prove in particular that all of the 
largest values of these densities occur when the modulus g is less than an explicit bound. In the 
proof of Theorem 1 1.1 [ we expanded several functions, including an instance of sin, into their power 
series at the origin. While this yielded an excellent theoretical formula, for numerical purposes we 
will take a slightly different approach involving the error function Erf (2) = ^ e"* dt. The 
following two lemmas allow us to write the density 5(g; a, b) in terms of the error function. 

Lemma 5.15. For any constants f > and p, 



J-o. t 

Proof. For the first identity, a change of variables gives 

3%/2^ 



V 



5/2 



by Lemma [2!2T| Our starting point for the second identity is [[B equation (7.4.6)]: for any constants 
a > and x. 



00 1 / 







2 V a 
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which can be rewritten as 



^e^^'' = J e""*' cos{2xty/a) dt. 
Integrating both sides from x = Oto x = w yields 

^ Erf(^) = r e--^ ( r cos(2xtv^) dx] dt = H ,-at^'^I}iMM dt 
(the interchanging of the integrals in the middle expression is justified by the absolute convergence 



of the integral). Setting a = ^ and w = -§=, we obtain 

' <^ Z \/2v 



^2v \V2v/ J^oo tV2v 
which establishes the lemma. □ 

Lemma 5.16. Assume GRH and LI. Let a be a nonsquare (mod q) and b a square (mod q). If 
V{q; a, h) > 531, then 



2 2 \^2V{q-aM 



+ O ,\4 + 0.03506—-— + 63.68p(g)e- 



V(g;a,6)i/2/2 



Proof. From Definition 2.1 1 we know that 

log'l',;,,^^ = 2^ 2^ log Jo ( ^^^- ^-11^' j 



X (mod q) 7>0 ^ A/ I + 7^ 

L(l/2+i7,x)=0 V ^ 



2 

X (mod (j) 7e 

L(l/2+i7,x)=0 



by the functional equation for Dirichlet L-functions. If |x| < \, then the argument of Jq is at most 
2-2 - = 2 in absolute value. Since the Taylor expansion log J{){x) = — + 0(. 0311a;'*) is 
valid for I x I < 2, we see that 



X (mod q) "i& 

L(l/2+i7,x)=0 



(1+7^)^ 



2^^ 5Z 5Z 1 1^2 

X(modg) 7eM 4 ^ ' 

L(l/2+i7,x)=0 



,0(|.^E E .03nl^4fl^) 

^ X(modg) 7ell 4V4 "r f 7 / 

L{l/2+i7,x)=0 

a, h)x'^ + 0(39.81\/(g; a, 6)x*) (5.13) 
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when \x\ < |. Moreover, the error term in the expansion log Jo(a;) = —x'^/A + O(.0311a;^) is 
always nonpositive as a consequence of Lemma 2.8 'c), and hence the same is true for the recently 
obtained error term 0(39.81V(g; a, b)x'^). This knowledge allows us to use the expansion e* = 
1 + 0{t) for t < 0, which yields 



when \x\ < 7. 



Proposition 2.18 says that when V{q; a, b) > 531, 



1 1 /•^(^^"''')"''* sinp(g)x^ , , ^ 
S{q; a,b) = - + — / — ^^$<?;a,6(a;) dx 

2 Z-K J-V{g;a,b)-y^ ^ 



/ „~9.089i((?) 

+ 0.03506 , , + 63.67p(g)e-^(^''''^)'^'/2 



Notice that V{q] a, b) ^/'^ < 531 < \, and so we may use our approximation for <l>q.a to 
deduce that 



5(g;a,6) = - + — r^''^^'^ ' sinp(g)x ^_^(^^, ^ 0(39.8iy(g; a, b)x^)) dx 

2 27r i_V'(g;a,6)-i/4 X 



/ „-9.08(/)(g) 

+ O 0.03506^—— + 63.67p(g)e-^(^'"''')'^'/2 



. (5.14) 



The main term can be evaluated by the second identity of Lemma 5.15 

v{,;a,i>)-y^ sinp(g)x 



1 

2^ 



y(g;a,6)-l/4 X 

-j^ /"OO 



27r 



sin p{q)x 



/I r"^ 


sin p(g)x 


Jv{q;a,b)--'/^ 





^Erf f^J^ 
2 Vv/2V%M) 



-V(g;a,fe)x2/2 ^, 



y(g;a,6)-l/4 
P(g) ^-\/(g;a,b)i/V2 



7rV(g;a,6)3/4 



The error term in the integral in equation ( 5.14 1 can be estimated by the first identity of Lemma 5.15 





|.V{g;a,fe)-i/4 


sin p(g)x 


27r, 


'-V(g;a,b)-i/4 


X 



/OO 
^4g-y(<7;a,6)xV2 < 19.008\/2^ • p{q)V [q] a, 6)" 
-OO 



■3/2 



68 



Therefore equation (|5.14|) becomes 



5{q;a,b) 



1 



Erf 



V^V{q;a,b) 

47.65p(g) 



O 



7TV{q; a.hfl^ 



-V'(g;a,fe)i/2/2 



+ 



+ 0.03506- 



-9m<t>(q) 



+ 63.67p(g)e 



-V(g;a,6)i/2/2 



V(g;a,6)3/2 ' 0(g) 

Since l/7rV^(g; a, 6)^/^ < l/7r(531)^/'' < 0.01, this last estimate implies the statement of the 
lemma. □ 

We are now ready to bound 5{q] a, h) for all large prime moduli q. 

Theorem 5.17. Assume GRH and LI. Ifq > 400 is prime, then 6{q; a, b) < 0.5262 for all reduced 
residues a and b (mod q). Ifq > 1000 is prime, then 6{q; a, b) < 0.51. 

Proof. We may assume that a is a nonsquare (mo d q) a nd 6 is a square (mod q), for otherwise 
S{q; a,b) < ^. When q > 331 is prime, Proposition 5.11 and a quick calculation yield 

V{q; a, b) > 2{q - l)(log q - 2.42) - 47.238 log^ q > 2g(log q - 2.42) - 48 log^ g > 531. (5.15) 

2 and 0(g) = g — 1) 



Therefore Lemma 5.16 applies, yielding (since p(g) = 

95.3 



6{q;a,b) 




y{q;a,b)y^ 



+ 0.03506 



g-1 



+ 127.36e-^(''^'^'^)'''/2 



+ 



2g(logg - 2.42) -481og^g 
95.3 



-9.08g 



(2g(log g - 2.42) - 48 log^ g)3/2 



0.03506- 



+ 127.36e 



-A/7(logg-2.42)/2-121og2 



using the second inequality in equation ( |5.15[ ). This upper bound is decreasing for g > 331, and so 
calculating it at g = 400 and g = 1000 establishes the inequalities given in the theorem. □ 

A similar bound for composite moduli g requires one last estimate. 

Lemma 5.18. For all q>3,we have p{q) < 2gi 04/iogiogg_ 

Proof. We first record some explicit estimates on the prime counting functions 7r(y) = J2p<y 1 
and 9(y) = J2p<y logP- Rosser and Shoenfeld [1^ Corollary 1 and Theorems 9 and 10] give, for 
y > 101, the bounds 0.84?/ < e{y) < 1.01624?/ and7r(|/) < 1.25506?// logy. Therefore 



, , 1.25506?/ 
7i{y) < — < 



1.25506^(?/)/0.84 l.5e{y) 



< 



(5.16) 



log y - log e{y) - log 1.01624 " log e{y) 
(a calculation shows that the last inequality holds for 6{y) > 61, which is valid in the range 

y > 101). 

Now consider integers of the form q{y) = Y[p<yP^ that ui{q{y)) = Tr{y) and log q{y) = 0{y). 
Equation ( |5.16[ ) becomes uj{q{y)) < 1.5(log q{y))/ log log q{y)\ while the derivation was valid for 
y > 101, one can calculate that the inequality holds for 3 < y < 101 as well. The following 
standard argument then shows that 

1.51ogg 

uj{q) < , — , (5.17) 

log log g 

69 



holds for all integers g > 3: if g has k distinct prime factors, then choose y to be the kth prime. 



Then the inequality ( |5.17[ ) has been shown to hold for q{y), and therefore it holds for q as well, 
since the left-hand side is k in both cases while the right-hand side is at least as large for q as it is 
for q{y). 

(This argument uses the fact that the right-hand side is an increasing function, which holds only 
for q> therefore technically we have proved ( |5.17[ ) only for numbers with at least three distinct 
prime factors, since only then does the corresponding q{y) exceed e^. However, the right-hand side 



of ( |5.17[ ) is always at least 4 in the range g > 3, and so numbers with one or two distinct prime 
factors easily satisfy the inequality.) 

Finally, the inequality p(g) < 2'^*^^^+^ that was noted in Definition 

p{q) < 2i+i-5(iogq)/iogiogg ^ 2gi-04/i°si°g9 for all g > 3, as desired. 



1.2 



allows us to conclude that 

□ 



Theorem 5.19. Assume GRH and LI. If q > 480 and g ^ {840, 1320}, then 6{q; a,b) < 0.75 for 
all reduced residues a and b (mod g). 



Proof. Again we may assume that a is a nonsquare (mod g) and 6 is a square (mod g). 
we restrict to the range g > 260000; by Proposition |5.14 we have V{q;a,b) > 531. 



First 
Using 

Lemma 5.16, together with the upper bound for p(g) from Lemma 5.18 and the lower bound for 
V{q; a, b) from Proposition 5.14 we have 



+ 



2g 



1.04/ log log q 



2y 0(g) ( log g - 1.02 log log g - 7.34) 

33 7gl 04/ log logg 



-9m<j>{q) 



(f)(q)3/2 ( log g - 1 .02 log log g - 7.34) 
+ 127.36gi-°^/i°s^°^''exp(^- '^^'^^ 



+ 0.03506- 



logg — 1.02 log log g — 7.34 



(5.18) 



Rosser and Schoenfeld [|T3l Theorem 15] have given the bound 



0(g) > 



e7o log log g + 2.50637/ log log g 



(5.19) 



for g > 3. When this lower bound is substituted for 0(g) in the upper bound ( |5.18| ), the result 
is a smooth function of g that is well-defined and decreasing for g > 260000, and its value at 
g = 260000 is less than 0.75. 

We now turn to the range 1000 < g < 260000. We first compute explicitly, for each such 



modulus g, the lower bound for V{q; a, b) in equation ( |5.12[ ); the value of this sharper lower bound 



turns out always to exceed 531 in this range. Consequently, we may use Lemma 5.16 together with 
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the lower bound for V{q; a, h) from equation ( |5.12| ), obtaining 



%;a,6)<i + iErf^ ^^^^ 



2 2 V2^0(g)(logg-log2vre7o-E^I^|^-ll^-?M2i2s!l 
17.85p(g) 

+ — + 0.03506—-— 

0(g)3/2(logg - Iog27re7o _ ^ _ IMa _ 2M2i2^)3/2 0(g) 



+ 63.68p(g) exp ^ ^ log g - log 27reTo - ^ 



logp 41ogg 23.62 log^g 



»- 1 g g 



This upper bound can be computed exactly for each g in the range 1000 < g < 260000; the only 
five moduli for which the upper bound exceeds 0.75 are 1020, 1320, 1560, 1680, and 1848. 



Finally, we use the methods described in Section 5.4 computing directly every value of 5(g; a, h) 
for the moduli 480 < g < 1000 and g G {1020, 1320, 1560, 1680, 1848} and verifying the in- 
equality (5(g; a, h) < 0.75 holds except for g = 840 and g = 1320, to complete the proof of the 
theorem. □ 

5.4. Explicit computation of the densities. Throughout this section, we assume GRH and LI, 
and we let a denote a nonsquare (mod g) and h a square (mod g). In this section we describe the 
process by which we computed actual values of the densities (5(g; a, h), resulting for example in 
the data given in the tables and figures of this paper. In fact, we used two different methods for 
these computations, one that works for "small g" and one that works for "large g". For ease of 
discussion, we define the sets 

5i = {3 < g < 1000 : g ^ 2 (mod 4) and 0(g) < 80} 

^2 = {101, 103, 107, 109, 113, 115, 119, 121, 123, 125, 129, 133, 141, 143, 145, 147, 153, 155, 159, 
164, 165, 171, 172, 175, 176, 177, 183, 184, 188, 189, 195, 196, 200, 208, 212, 220, 224, 225, 
231, 232, 236, 255, 260, 264, 276, 280, 288, 300, 308, 312, 324, 336, 348, 360, 372, 396, 420} 

^3 = {3 < g < 1000 : g ^ 2 (mod 4) and 0(g) > 80} \ ^2 

^4 = {1020, 1320, 1560, 1680, 1848}. 

We omit integers congruent to 2 (mod 4) from these sets, since for odd g the prime number race 
(mod 2g) is identical to the prime number race (mod g). 



For the moduli g in the set 5*1 U 5*2, we numerically evaluated the integral in equation ( |2.10| ) 
directly; this method was used by Feuerverger and Martin ||5l and is analogous to, and indeed based 
upon, the method used by Rubinstein and Sarnak [[T4|. We first used Rubinstein's computational 
package Icalc to calculate, for each character x (mod g), the first A^(g) nontrivial zeros of L(s, x) 
lying above the real axis. The term $g;a,6 in the integrand is a product of functions of the form 
F(2;,x), which is indexed by infinitely many zeros of L(s,x); we approximated F{z,x) by its 
truncation at A^(g) zeros, multiplied by a compensating quadratic polynomial as in [14, Section 



4.3]. With this approximation to the integral (2.10), we truncated the range of integration to an 



interval [— C(g) , C(g)] and then discretized the truncated integral, replacing it by a sum over points 
spaced by e{q) as in [1 141 Section 4.1]. The result is an approximation to 5(g; a, h) that is valid up 
to at least 8 decimal places, provided we choose A^(g), C(g), and e{q) carefully to get small errors. 
(All of these computations were performed using the computational software Mathemat ica.) 
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Explicitly bounding the error in this process is not the goal of the present paper; we refer the 
interested reader to lfT4l for rigorous error bounds of this kind, corresponding to their calculation 
of 6{q; N,R) for g G {3,4,5,7,11,13}. 

For the moduli q in the set 5*3 U 5*4 (and for any other moduli larger than 1000 we wished to 
address), we used an approach based on our asymptotic formulas for S{q; a, h). We now outline a 
variant of the asymptotic formulas described earlier in this paper, one that was optimized somewhat 
for the the actual computations rather than streamlined for theoretical purposes. 



We first note that a slight modification of the proof of Proposition 2.18 yields the estimate, for 

_5_ 
24' 



any < K < ^ 



5{q- ^) = 2 + ^ j ^^^q;aA^) dx 

+ Oi^-J p(g)|$,;a,fe(x)| + 0.03506^^ + 63.67p(g)|$,;a,fe(^)|J (5.20) 

as long as V{q] a, b) > 531. In addition we have, for |x| < the inequalities 

- lV{q; a, b)x^ - U{q; a, b)x^ - 15.816f/(g; a, h)x^ 

< log ^q-a,b{x) < -lV{q; a, h)x^ - U {q; a, b)x^, 

where for convenience we have defined U{q; a, h) = W2{q; a, b)V{q; a, b); these inequalities can 
be proved using an argument similar to the calculation in equation ( |5.13[ ), but employing the more 
precise estimate log Jo{z) = —z^ /A — 2:^/64 + 0(0.003862;^) for \z\ < 2. Using the methods of 
Section 13. 41 we also obtain the formula 



U{q;a,b) = —i3 + ,,iab )) i^og - — - — 



. (5.21) 



If we define /t(g; a, b) = min(^, V{q; a, b) ^/^), then we know that K{q; a,b) < ^ because of 

the lower bound V{q; a, b) > 531, and also that (sin p{q)x)/x is nonnegative for < ^(g; a, b). 
Hence, equation ( |5.20[ ) and the subsequent discussion establishes the following proposition: 

Proposition 5.20. Assume GRH and LI, and let a be a nonsquare (mod q) and b a square (mod q). 
IfV{q;a,b) > 531, then 

^ ^ ^ Z"'"^'''"'^'' Smp(g)x^_y(^.„;,)^.2/2-i7(g;a,6)x4-15.816i7(g;a,6)^^ 



2 271 .hf,(^q.a,b) X 



< S{q; a,b)<- + — r^'''^'"^ Pi^)^ ^-V(,;a,t).y2-Uig;a,b).^ ^ y [q- a, 6), 
2 27r J^^(q.a,b) X 
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where 



^ J K{q;a,b) 



„-9.08(/>(g) 

+ 0.03506 , . + 63.67p(g)e-25Vtaa,6)/1152-(5/24)^l/taa,6) 



0(g) 



and formulas for V{q] a, h) and U (g; a, 6) are ^/ven in Theorem 1.4 and equation ( 5.21 1, respec- 
tively. 



The inequalities in Proposition 5.20 give accurate evaluations of a, 6) when (j){q) is large; 



we chose the inequality 0(g) > 80 to be our working definition of "large". For each of the moduli 
q in the set 5*3 U 5*4, we computed every possible value of V{q] a, h) and verified that they all exceed 
531, so that Proposition |5 . 20| can be used. (The reason that the moduli in 5*2 were calculated using 
the first method, rather than this one, is because at least one variance V{q] a, h) was less than 531 



for each of the moduli in 82-) We then calculated the upper and lower bounds of Proposition 5.20 



using numerical integration in pari/gp, to obtain all values of 6{q;a,b). The calculation of 
V{q;a,b) and U{q;a,b) involve the analytic terms L{l,x), L'{l,x)^ and L"(l,x); we used the 
pari/gp package computeL (see [4J) to obtain these values accurate to 16 decimal places. 

Table |7] gives a sample of the data we calculated with this second method, including the error 
bounds obtained. The error bounds are stronger for when q and 0(g) are large, explaining why the 
error bounds for the large prime g = 997 are so much better than for the smaller composite number 
g = 244. 



Table 7. The 20 smallest values of 5(244; a, 1) and of 5(997; a, 1), calculated 



using Proposition 5.20 



q 


a 


a-' 


S{q;a, 1) 


Error bound 




q 


a 


a-' 


6{q;a, 1) 


Error bound 


244 


243 


243 


0.558910 


0.000022 




997 


2 


499 


0.508116457 


0.000000014 


244 


123 


123 


0.559000 


0.000018 




997 


5 


399 


0.508142372 


0.000000015 


244 


3 


163 


0.562304 


0.000020 




997 


7 


285 


0.508184978 


0.000000015 


244 


7 


35 


0.563216 


0.000022 




997 


11 


272 


0.508238549 


0.000000016 


244 


31 


63 


0.563543 


0.000022 




997 


17 


176 


0.508279881 


0.000000016 


244 


153 


185 


0.563804 


0.000021 




997 


29 


722 


0.508329803 


0.000000016 


244 


11 


111 


0.564069 


0.000024 




997 


37 


512 


0.508345726 


0.000000016 


244 


29 


101 


0.564124 


0.000024 




997 


41 


535 


0.508351018 


0.000000016 


244 


17 


201 


0.564321 


0.000023 




997 


8 


374 


0.508353451 


0.000000016 


244 


33 


37 


0.564436 


0.000024 




997 


43 


371 


0.508355411 


0.000000016 


244 


19 


167 


0.564741 


0.000024 




997 


47 


297 


0.508358709 


0.000000016 


244 


23 


191 


0.564786 


0.000023 




997 


61 


474 


0.508368790 


0.000000016 


244 


107 


187 


0.565310 


0.000024 




997 


163 


367 


0.508392448 


0.000000016 


244 


69 


145 


0.565319 


0.000022 




997 


103 


242 


0.508392587 


0.000000016 


244 


53 


221 


0.565376 


0.000022 




997 


113 


150 


0.508395577 


0.000000016 


244 


85 


89 


0.565606 


0.000022 




997 


181 


661 


0.508397690 


0.000000016 


244 


129 


157 


0.565683 


0.000021 




997 


127 


840 


0.508402416 


0.000000016 


244 


173 


189 


0.565707 


0.000023 




997 


157 


870 


0.508404812 


0.000000016 


244 


177 


193 


0.565859 


0.000023 




997 


283 


613 


0.508406794 


0.000000016 


244 


103 


199 


0.565861 


0.000024 




997 


179 


518 


0.508406994 


0.000000016 
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We also take this opportunity to reinforce the patterns described in Section 4. 1 For q = 244, the 
residue class a = 123 has the property that q/{q, a — 1) = 2; thus the contribution of _ft'244(122) 
to A(244; 123, 1) reduces the density 5(244; 123, 1). We see also the familiar small densities cor- 
responding to a = 243 = — 1 (mod 244) and to small prime values of a. For q = 997, the small 
prime values of a (among those that are nonsquares modulo 997) appear in perfect order. We point 
out that the residue class a = 8 is almost in its correct limiting position, since the contribution to 
A(997; a, 1) is inversely correlated to ^, and ^ > ^ > ^ > 

We mention that we undertook the exercise of calculating values 5{q; a, h) by both methods, for 
several intermediate values of q, as a way to verify our computations. For example, the calculations 
of (5(163; a, 6) (see Table |3]) were done using the integral formula ( |2.10[ ) as described above. We 
calculated these same densities using Proposition |5.20[ the error bounds obtained were all at most 
4.6 X 10^^, and the results of the first calculation all lay comfortably within the intervals defined 
by the second calculation. 

Finally, the upper bounds for 5(g; a, 6) in Theorems 5.17 and 5.19 together with the explicit 
calculation of the densities 5(g; a, h) for g G U 5*2 U 5*3 U 5*4, allow us to determine the most 
biased possible two-way races, that is, the largest values of 5(g; a, h) among all possible choices 
of g, a, and h. Inparticular, we verified Theorem |1.11| in this way, and we list the 120 largest 
densities in Table [S, there are precisely 117 distinct densities above ^. (It is helpful to recall here 
that (5(g; a, 1) = 5{q] a~\ 1) and that 5(g; a, 1) = 5(g; ah, b) for any nonsquare a and square b 
modulo q.) 
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